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(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val Tyr Glu Ala 

35 40 45 

GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC GTT CGG GAG 192 

Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys Val Arg Glu 

50 55 60 

AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG CTC GCA GCT 24 0 

Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr Leu Ala Ala 

65 70 75 30 

AGG AAC GCC AGC GTC CCC ACC ACG ACA ATA CGA CGC CAC GTC GAT TTG 283 

Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His Val Asp Leu 

85 90 95 

CTC GTT GGG GCG GCT GCT CTC TGT TCC GCT ATG TAC GTG GGG GAT CTC 336 

Leu Val Gly Ala Ala Ala Leu Cys Ser Ala Met Tyr Val Gly Asp Leu 

100 105 110 

ITGC GGA TCT GTC TTC CTC GTC TCC CAG CTG TTC ACC ATC TCG CCT CGC 384 

:Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He Ser Pro Arg 

115 120 125 

CGG CAT GAG ACG GTG CAG GAC TGC AAT TGC TCA ATC TAT CCC GGC CAC 4 32 

Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His 

130 135 140 

ATA ACA GGT CAC CGT ATG GCT TGG GAT ATG ATG ATG AAC TGG TCG CCT 4 80 

He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp Ser Pro 

145 150 155 160 

ACA ACG GCC CTG GTG GTA TCG CAG CTG CTC CGG ATC CCA CAA GCT GTC 528 

Thr Thr Ala Leu Val Val Ser Gin Leu Leu Arg He Pro Gin Ala Val 

165 170 175 

GTG GAC ATG GTG GCG GGG GCC CAT TGG GGA GTC CTG GCG GGC CTC GCC 57 6 

Val Asp Met Val Ala Gly Ala His Trp Gly Val Leu Ala Gly Leu Ala 

180 135 190 

TAC TAT TCC ATG GTG GGG AAC TGG GCT AAG GTT TTG ATT GTG ATG CTA 624 

Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val Leu He Val Met Leu 

195 200 205 

CTC TTT GCT CTC TAATAG 642 
Leu Phe Ala Leu 
210 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 4: 

Met Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu Leu Ser Cys 
1 5 10 15 
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Leu Thr lie Pro Ala Ser Ala Tyr Glu Val Arg Asn Val Ser Gly Met 
20 25 30 

Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser lie Val Tyr Glu Ala 
35 * 40 45 

Ala Asp Met lie Met His Thr Pro Gly Cys Val Pro Cys Val Arg Glu 
50 55 60 

Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr Leu Ala Ala 
65 70 75 80 

Arg Asn Ala Ser Val Pro Thr Thr Thr lie Arg Arg His Val Asp Leu 
85 90 95 

Leu Val Gly Ala Ala Ala Leu Cys Ser Ala Met Tyr Val Gly Asp Leu 
p 100 105 110 

-5Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr lie Ser Pro Arg 
% 115 120 125 

^krq His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His 

JU 130 135 140 

tile Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp Ser Pro 
"14 5 150 155 160 

Hrhr Thr Ala Leu Val Val Ser Gin Leu Leu Arg lie Pro Gin Ala Val 

_j 165 170 175 

J Val Asp Met Val Ala Gly Ala His Trp Gly Val Leu Ala Gly Leu Ala 
3 180 185 190 

~Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val Leu He Val Met Leu 
195 200 205 

Leu Phe Ala Leu 
210 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 795 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
fiii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..792 



(ix) FEATURE: 



. (A) NAME /KEY: mat_peptide 
(B) LOCATION: 1..789 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ATG TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC GGC TTC GCC GAC CTC 4 8 

Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

GTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 96 
Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT GCA 144 
Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

ffACA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG 192 

3thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 

^ 50 55 60 

vStTG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG 24 0 

Lxieu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 

O 65 70 75 80 

T:CC GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 28 8 

"LSer Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser lie Val 
U 85 90 95 

QTAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 336 

ilTyr Glu Ala Ala Asp Mec He Met His Thr Pro Gly Cys Val Pro Cys 
^ 100 105 110 

H3TT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 38 4 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 

115 120 125 

CTC GCA GCT AGG AAC GCC AGC GTC CCC ACC ACG ACA ATA CGA CGC CAC 432 
Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

GTC GAT TTG CTC GTT GGG GCG GCT GCT TTC TGT TCC GCT ATG TAC GTG 480 
Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
145 150 155 160 

GGG GAC CTC TGC GGA TCT GTC TTC CTC GTC TCC CAG CTG TTC ACC ATC 528 
Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr lie 
165 170 175 

TCG CCT CGC CGG CAT GAG ACG GTG CAG GAC TGC AAT TGC TCA ATC TAT 57 6 

Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 
180 185 190 

CCC GGC CAC ATA ACG GGT CAC CGT ATG GCT TGG GAT ATG ATG ATG AAC 624 
Pro Gly His He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
195 200 205 

TGG TCG CCT ACA ACG GCC CTG GTG GTA TCG CAG CTG CTC CGG ATC CCA 672 
Trp Ser Pro Thr Thr Ala Leu Val Val Ser Gin Leu Leu Arg He Pro 
210 215 220 
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CAA GCT GTC GTG GAC ATG GTG GCG GGG GCC CAT TGG GGA GTC CTG GCG 720 
Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly Val Leu Ala 
225 230 235 240 

GGT CTC GCC TAC TAT TCC ATG GTG GGG AAC TGG GCT AAG GTT TTG ATT 768 
Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val Leu He 
245 250 255 

GTG ATG CTA CTC TTT GCT CCC TAATAG 7 95 

Val Met Leu Leu Phe Ala Pro 
260 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 263 amino acids 
^ (B) TYPE: amino acid 

tfi (D) TOPOLOGY: linear 

ifj (ii) MOLECULE TYPE: protein 

r\ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Rfet Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
W 1 5 10 15 

rVal Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 

C; 20 25 30 

tkla Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
Uj 35 4 0 4 5 

MThr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 

Tyr Glu Ala Ala Aso Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 * 105 HO 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

Val Asd Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
145 " 150 155 160 

Gly Aso Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr lie 
165 170 175 

Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 
180 135 190 
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Pro Gly His He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
195 200 205 

Trp Ser Pro Thr Thr Ala Leu Vai Val Ser Gin Leu Leu Arg He Pro 
210 215 220 

Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly Val Leu Ala 
225 230 235 240 

Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val Leu He 
245 250 255 



Val Met Leu Leu Phe Ala Pro 
260 



(2) INFORMATION FOR SEQ ID NO: 7; 

% (i) SEQUENCE CHARACTERISTICS: 

J! (A) LENGTH: 633 base pairs 

=5 ; (3) TYPE: nucleic acid 

-D (C) STRANDEDNESS : single 

£1 (D) TOPOLOGY: linear 

5 (ii) MOLECULE TYPE: cDNA 

^ {iii) HYPOTHETICAL: NO 

3 (iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY : CDS 
(3) LOCATION: 1..630 

(ix) FEATURE: 

(A) NAME /KEY : mat_pepticie 
(3) LOCATION: 1..627 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

ATG TTG GGT AAG GTC ATC GAT ACC CTT ACG TGC GGC TTC GCC GAC CTC 48 

Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

ATG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGT GCT GCC AGA 96 

Met Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 ?^ 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAA GAC GGC GTG AAC TAT GCA 14 4 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 

35 40 45 

ACA GGG AAT TTG CCT GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTA 192 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

CTG TCC TGT CTG ACC ATT CCA GCT TCC GCT TAT GAG GTG CGC AAC GTG 240 

Leu Ser Cys Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 
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TCC GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 288 
Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser lie Val 
85 90 95 

TAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 336 
Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

GTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 38 4 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

CTC GCA GCT AGG AAC GCC AGC GTC CCC ACT ACG ACA ATA CGA CGC CAC 432 
Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

H?TC GAT TTG CTC GTT GGG GCG GCT GCT TTC TGT TCC GCT ATG TAC GTG 48 0 

% 3/al Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
Jy.4 5 150 155 160 

y&GG GAT CTC TGC GGA TCT GTC TTC CTC GTC TCC CAG CTG TTC ACC ATC 528 
ifiSly Asn Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He 
hi ^ 155 170 175 

rtCG CCT CGC CGG CAT GAG ACG GTG CAG GAC TGC AAT TGC TCA ATC TAT 57 5 

i4 Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 
180 185 190 

S.CCC GGC CAC ATA ACA GGT CAC CGT ATG GCT TGG GAT ATG ATG ATG AAC 624 
rpro Gly His He Thr Gly His Arg Met Ala Trp Asp Met Me- Met Asn 
«: 195 200 205 



j^TGG TAATAG 
210 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 209 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

Met Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 



633 
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Leu Ser Cys Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 30 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 

Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
145 150 155 160 

*3siy Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He 
S 165 170 175 

^er Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 

iso 185 190 

ft>ro Glv His lie Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
~ 195 200 205 



ff;(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 483 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

, (iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(ix) FEATURE: 

(A) NAME /KEY : CDS 
{BJ LOCATION: 1..480 

(ix) FEATURE: 

(A) NAME /KEY: matjpeptide 

(B) LOCATION: 1..477 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

ATG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCC CTG CTG TCC TGT 48 
Met Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu Leu Ser Cys 
15 10 15 

CTG ACC ATA CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG TCC GGG GTG 96 



HCVENV.-T 



t 



Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val Ser Gly Val 
20 25 30 

TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATA GTG TAT GAG GCA 14 4 

Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser lie Val Tyr Glu Ala 
35 40 45 

GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC GTT CGG GAG 192 
Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys Val Arg Glu 
50 55 60 

GGC AAC TCC TCC CGT TGC TGG GTG GCG CTC ACT CCC ACG CTC GCG GCC 240 
Gly Asn Ser Ser Arg Cys Tro Val Ala Leu Thr Pro Thr Leu Ala Ala 
65 70 75 30 

AGG AAC GCC AGC GTC CCC ACA ACG ACA ATA CGA CGC CAC GTC GAT TTG 28 3 

Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His Val Asp Leu 
85 90 95 

; ftTC GTT GGG GCT GCT GCT TTC TGT TCC GCT ATG TAC GTG GGG GAT CTC 33 6 

EELeu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val Gly Asp Leu 

if! 100 105 110 

HrCC GGA TCT GTT TTC CTT GTT TCC CAG CTG TTC ACC TTC TCA CCT CGC 38 4 

%ys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr Phe Ser Pro Arg 

H 115 120 125 

= CGG CAT CAA ACA GTA CAG GAC TGC AAC TGC TCA ATC TAT CCC GGC CAT 4 32 

PArg His Gin Thr Val Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His 

Sj 130 135 140 

tfeTA TCA GGT CAC CGC ATG GCT TGG GAT ATG ATG ATG AAC TGG TCC TAATAG 4 83 

^Vai Ser Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp Ser 

Dl45 150 155 160 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu Leu Ser Cys 
15 10 15 

Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val Ser Gly Val 
20 25 30 

Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val Tyr Glu Ala 
35 " 40 45 

Ala Asd Met lie Met His Thr Pro Gly Cys Val Pro Cys Val Arg Glu 
50 55 60 

Gly Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr Leu Ala Ala 
65 70 75 30 
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Arg Asn Ala Ser Val Pro Thr Thr 
35 

Leu Val Gly Ala Ala Ala Phe Cys 
100 

Cys Gly Ser Val Phe Leu Val Ser 
115 120 

Arg His Gin Thr Val Gin Asp Cys 
130 135 

Val Ser Gly His Arg Met Ala Trp 
145 150 



Thr lie Arg Arg His Val Asp Leu 
90 95 

Ser Ala Met Tyr Val Gly Asp Leu 
105 110 

Gin Leu Phe Thr Phe Ser Pro Arg 
125 

Asn Cys Ser He Tyr Pro Gly His 
140 

Asp Met Met Met Asn Trp Ser 
155 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 480 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
{iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(3) LOCATION: 1..477 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 1..474 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



ATG 


TCC 


GGT 


TGC 


TCT 


TTC 


TCT 


ATC 


TTC 


CTC 


TTG 


GCC 


CTG 


CTG 


TCC 


TGT 


Met 


Ser Gly 


Cys 


Ser 


Phe 


Ser 


He 


Phe 


Leu 


Leu 


Ala 


Leu 


Leu 


Ser 


Cys 


1 






5 










10 










15 




CTG 


ACC 


ATA 


CCA 


GCT 


TCC 


GCT 


TAT 


GAA 


GTG 


CGC 


AAC 


GTG 


TCC 


GGG 


GTG 


Leu 


Thr 


He 


Pro 


Ala 


Ser 


Ala 


Tyr 


Glu 


Val 


Arg 


Asn 


Val 


Ser 


Gly 


Val 








20 










25 










30 






TAC 


CAT 


GTC 


ACG 


AAC 


GAC 


TGC 


TCC 


AAC 


TCA 


AGC 


ATA 


GTG 


TAT 


GAG 


GCA 


Tyr 


His 


Val 


Thr 


Asn Asp 


Cys 


Ser 


Asn 


Ser 


Ser 


He Val Tyr Glu Ala 




35 










40 










45 








GCG 


GAC 


ATG 


ATC 


ATG 


CAC 


ACC 


CCC 


GGG 


TGC 


GTG 


CCC 


TGC 


GTT 


CGG 


GAG 


Ala 


Aso 


Met 


He 


Met 


His 


Thr 


Pro 


Gly 


Cys 


Val 


Pro 


Cys 


Val 


Arg 


Glu 




50 










55 










60 










GGC 


AAC 


TCC 


TCC 


CGT 


TGC 


TGG 


GTG 


GCG 


CTC 


ACT 


CCC 


ACG 


CTC 


GCG 


GCC 


Gly Asn 


Ser 


Ser 


Arg 


Cys 


Trp 


Val 


Ala 


Leu 


Thr 


Pro 


Thr 


Leu 


Ala 


Ala 


65 










70 










75 










80 
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AGG AAC GCC AGC GTC CCC ACA ACG ACA ATA CGA CGC CAC GTC GAT TTG 288 
Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His Val Asp Leu 
35 90 95 

CTC GTT GGG GCT GCT GCT TTC TGT TCC GCT ATG TAC GTG GGG GAT CIC 336 
Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val Gly Asp Leu 
100 105 110 

TGC GGA TCT GTT TTC CTT GTT TCC CAG CTG TTC ACC TTC TCA CCT CGC 384 
Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr Phe Ser Pro Arg 
115 120 125 

CGG CAT CAA ACA GTA CAG GAC TGC AAC TGC TCA ATC TAT CCC GGC CAT 432 
Arg His Gin Thr Val Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His 
130 135 140 

HpTA TCA GGT CAC CGC ATG GCT TGG GAT ATG ATG ATG AAC TGG TAATAG 4 80 

Tval Ser Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 
S445 150 155 

g!i{2) INFORMATION FOR SEQ ID NO: 12: 

S (i) SEQUENCE CHARACTERISTICS: 
I™ (A) LENGTH: 158 amino acids 

^ (B) TYPE: amino acid 

^ (D) TOPOLOGY: linear 

SI (ii) MOLECULE TYPE: protein 

5: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Hlet Ser Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu Leu Ser Cys 
H ! 15 10 15 

Leu Thr He Pro Ala Ser Ala Tyr Giu Val Arg Asn Val Ser Gly Val 
20 25 30 

Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val Tyr Glu Ala 
35 " 40 45 

Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys Val Arg Glu 

50 55 60 

Gly Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr Leu Ala Ala 

65 70 75 80 

Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His Val Asp Leu 
85 90 95 

Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val Gly Asp Leu 
100 105 110 

Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr Phe Ser Pro Arg 
115 120 125 

Arg His Gin Thr Val Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His 
130 135 140 

Val Ser Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 
145 150 155 
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(2} INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 636 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..633 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(3) LOCATION: 1..630 



y (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

IATG CTG GGT AAG GCC ATC GAT ACC CTT ACG TGC GGC TTC GCC GAC CTC 

■Met Leu Gly Lys Ala He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
l\ i 5 10 15 



TCC GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 
Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 



48 



LbTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 96 
*Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
* 20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAA GAC GGC GTG AAC TAT GCA 14 4 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

ACA. GGG AAT TTG CCT GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTA 192 
Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

CTG TCC TGT CTA ACC ATT CCA GCT TCC GCT TAC GAG GTG CGC AAC GTG 24 0 

Leu Ser Cys Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 



288 



TAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 336 

Tyr Glu Ala Ala Aso Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 ^ 105 110 

GTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 384 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 

115 120 125 

CTC GCG GCT AGG AAC GCC AGC ATC CCC ACT ACA ACA ATA CGA CGC CAC 432 
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Leu Ala Ala Arg Asn Ala Ser lie Pro Thr Thr Thr He Arg Arg His 
130 135 140 

GTC GAT TTG CTC GTT GGG GCG GCT GCT TTC TGT TCC GCT ATG TAC GTG 480 
Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
145 150 155 160 

GGG GAT CTC TGC GGA TCT GTC TTC CTC GTC TCC CAG CTG TTC ACC ATC 528 
Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He 
165 170 175 

TCG CCT CGC CGG CAT GAG ACG GTG CAG GAC TGC AAT TGC TCA ATC TAT 57 6 

Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 
180 185 190 

CCC GGC CAC ATA ACG GGT CAC CGT ATG GCT TGG GAT ATG ATG ATG AAC 624 
Pro Gly His He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
195 200 205 

TGG TAC TAATAG 640 
Trp Tyr 
210 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Leu Gly Lys Ala He Aso Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

Leu Ser Cys Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 

Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

Leu Ala Ala Arg Asn Ala Ser He Pro Thr Thr Thr He Arg Arg His 
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130 



Val 
145 



Asp Leu Leu Val Gly 

150 



Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
155 160 



Gly 



Asp Leu Cys Gly Ser 
165 



Val Phe Leu Val Ser Gin Leu Phe Thr He 
170 175 



Ser 



Pro Arg Arg His Glu 
180 



Thr Val Gin Asp Cys Asn Cys Ser He Tyr 
185 190 



Pro 



Gly His He Thr Gly 
195 



His Arg Met Ala Trp Asp Met Met Met Asn 
200 205 



Trp Tyr 
210 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
ATGCCCGGTT GCTCTTTCTC TATCTT 26 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
ATGTTGGGTA AGGTCATCGA TACCCT 26 
(2) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 



HCVENV.-T 



(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE : YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 

CTATTAGGAC CAGTTCATCA TCATATCCCA 

r {2) INFORMATION FOR SEQ ID NO: 13: 

^ (i) SEQUENCE CHARACTERISTICS: 
K j (A) LENGTH: 27 base pairs 

yj (B) TYPE: nucleic acid 

d3 (C) STRANDEDNESS: single 

hj (D) TOPOLOGY: linear 

fTi (ii) MOLECULE TYPE : cDNA 

(iii) HYPOTHETICAL: NO 

SI (iiD ANTI-SENSE: YES 



* j (xi) SEQUENCE DESCRIPTION: SEQ ID NO 
CTATTACCAG TTCATCATCA TATCCCA 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 

ATACGACGCC ACGTCGATTC CCAGCTGTTC ACCATC 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 36 base pairs 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
{iii) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

GATGGTGAAC AGCTGGGAAT CGACGTGGCG TCGTAT 36 

(2) INFORMATION FOR SEQ ID NO: 21: 

t (i) SEQUENCE CHARACTERISTICS : 

I! (A) LENGTH: 723 base pairs 

(B) TYPE: nucleic acid 
^ (C) STRANDEDNESS: single 

[i (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

s (iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..720 

(ix) FEATURE: 

(A) NAME/KEY: mat_pep-ide 

(B) LOCATION: 1..717 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

ATG TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC GGC TTC GCC GAC CTC 48 

Met Leu Gly Lys Val He Asd Thr Leu Thr Cys Gly Phe Ala Asp Leu 

1 5 * 10 15 

GTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 96 

Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 

20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT GCA 14 4 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 

35 40 45 

ACA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG 192 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 

50 55 60 

CTG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG 240 

Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 

65 70 75 80 
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TCC 


GGG 


ATG 


TAC 


CAT 


GTC 


ACG 


AAC 


GAC 


TGC 


TCC 


AAC 


TCA 


AGC 


ATT 


GTG 


288 


Ser Gly 


Met 


Tyr 


His 


Val 


Thr 


Asn 


Asp 


Cys 


Ser 


Asn 


Ser 


Ser 


He 


Val 










85 










90 










o c 
3d 






TAT 


GAG 


GCA 


GCG 


GAC 


ATG 


ATC 


ATG 


CAC 


ACC 


CCC 


GGG 


TGC 


GTG 


CCC 


TGC 


336 


Tvr 


Glu 


Ala 


Ala 


Asp 


Met 


He 


Met 


His 


Thr 


Pro 


Gly Cys 


Val 


Pro 


Cys 








100 








1UD 










110 








GTT 


CGG 


GAG 


AAC 


AAC 


TCT 


TCC 


CGC 


TGC 


TGG 


GTA 


GCG 


CTC 


ACC 


CCC 


ACG 


384 


Val 


Arg 


Glu 


Asn 


Asn 


Ser 


Ser 


Arg 


Cys 


Trp 


Val 


Ala 


Leu 


Thr 


Pro 


Thr 






115 










120 










125 










CTC 


GCA 


GCT 


AGG 


AAC 


GCC 


AGC 


GTC 


CCC 


ACC 


ACG 


ACA 


ATA 


CGA 


CGC 


CAC 


432 


Leu 


Ala 


Ala 


Arg 


Asn 


Ala 


Ser 


Val 


Pro 


Thr 


Thr 


Thr 


lie Arg Arg 


His 






130 








135 










140 












GTC 


GAT 


TCC 


CAG 


CTG 


TTC 


ACC 


ATC 


TCG 


CCT 


CGC 


CGG 


CAT 


GAG 


ACG 


GTG 


480 


n Val 


Asp 


Ser 


Gin 


Leu 


Phe 


Thr 


He 


Ser 


Pro 


Arg 


Arg 


His 


Glu 


Thr 


Val 




: n 14 5 








150 










155 










160 




^ CAG 


GAC 


TGC 


AAT 


TGC 


TCA 


ATC 


TAT 


CCC 


GGC 


CAC 


ATA 


ACG 


GGT 


CAC 


CGT 


528 


^Gln Asp 


Cys 


Asn 


Cys 


Ser 


He 


Tyr 


Pro 


Gly 


His 


He 


Thr 


Gly His 


Arg 










165 










170 










175 






O ATG 


GCT 


TGG 


GAT 


ATG 


ATG 


ATG 


AAC 


TGG 


TCG 


CCT 


ACA 


ACG 


GCC 


CTG 


GTG 


576 


U\ Met 


Ala 


Trp 


Asp 


Met 


Met 


Met 


Asn 


Trp 


Ser 


Pro 


Thr 


Thr 


Ala 


Leu 


Val 










180 










185 










190 








P GTA 


TCG 


CAG 


CTG 


CTC 


CGG 


ATC 


CCA 


CAA 


GCT 


GTC 


GTG 


GAC 


ATG 


GTG 


GCG 


624 


%l Val 


Ser 


Gin 


Leu 


Leu 


Arg 


He 


Pro 


Gin 


Ala 


Val 


Val 


Asp 


Met 


Val 


Ala 








195 










200 










205 










?=; GGG 


GCC 


CAT 


TGG 


GGA 


GTC 


CTG 


GCG 


GGT 


CTC 


GCC 


TAC 




TCC 


ATG 


GTG 


672 


r: : Gly Ala 


His 


Trp 


Gly 


Val 


Leu 


Ala 


Gly 


Leu 


Ala 


Tyr 


Tyr 


Ser 


Met 


Val 






210 










215 










220 













GGG AAC TGG GCT AAG GTT TTG ATT GTG ATG CTA CTC TTT GCT CCC TAATAG 723 
Gly Asn Trp Ala Lys Val Leu He Val Met Leu Leu Phe Ala Pro 
225 230 235 240 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 239 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
1 5 " 10 15 

Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 
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ft 



Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 

50 55 60 

Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 

65 70 75 30 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 

Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

Sval Asp Ser Gin Leu Phe Thr He Ser Pro Arg Arg His Glu Thr Val 
3145 150 155 160 

-Gin Aso Cys Asn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg 
=; ~ 165 170 175 

: Met Ala Trp Aso Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val 

"j 180 185 190 

" Val Ser Gin Leu Leu Arg He Pro Gin Ala Val Val Asp Met Val Ala 
195 200 205 

-'Gly Ala His Tro Gly Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val 
J 210 * 215 220 

B iGly Asn Trp Ala Lys Val Leu lie Val Met Leu Leu Phe Ala Pro 
j : 225 230 235 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 561 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



FEATURE : 

(A) NAME /KEY : CDS 

(B) LOCATION: 1. .558 



FEATURE : 

(A) NAME /KEY : 

(B) LOCATION: 



matjoeptide 
1. .555 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 



ATG TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC GGC TTC GCC GAC CTC 48 

Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

GTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 96 

Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT GCA 14 4 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

ACA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG 192 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

CTG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG 240 

Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 

65 70 75 80 

TCC GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 28 8 

'Ser Gly Met Tyr His Val Thr Asn Aso Cys Ser Asn Ser Ser lie Val 
85 90 95 

■iTAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 336 

Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
; 100 105 110 

;GTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 384 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

; CTC GCA GCT AGG AAC GCC AGC GTC CCC ACC ACG ACA ATA CGA CGC CAC 432 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

GTC GAT TCC CAG CTG TTC ACC ATC TCG CCT CGC CGG CAT GAG ACG GTG 480 
Val Aso Ser Gin Leu Phe Thr He Ser Pro Arg Arg His Glu Thr Val 

145 ' 150 155 160 

CAG GAC TGC AAT TGC TCA ATC TAT CCC GGC CAC ATA ACG GGT CAC CGT 528 

Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg 
165 170 175 

ATG GCT TGG GAT ATG ATG ATG AAC TGG TAATAG 561 
Met Ala Trp Asp Met Met Met Asn Trp 
180 185 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 185 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 



€1 



10 



15 



Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 * 30 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 

-tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
□ 100 105 110 

^Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
I 115 120 125 

"4eu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
3 130 135 140 

"Val Asp Ser Gin Leu Phe Thr He Ser Pro Arg Arg His Glu Thr Val 
2 145 150 155 160 

'; ; Gln Asp Cys Asn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg 
J 165 170 175 



JMet Ala Trp Asp Mer Met Met Asn Trp 

s t 180 185 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 606 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..603 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(3) LOCATION: 1 . . 600 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



ATG TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC GGC TTC GCC GAC CTC 
Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

GTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 
Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT GCA 
Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

ACA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG 
Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

CTG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG 
-Leu Se^ Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 

|;i65 70 75 80 

irCC GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 
tfee^ Gly Men Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser lie Val 
^ 8 5 90 95 

-TAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 
;Tyr Glu Ala Ala Aso Met He Met His Thr Pro Gly Cys Val Pro Cys 
7 100 * 105 no 

""|3TT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 
' S Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
«! 115 120 125 

"TTC GCA GCT AGG AAC GCC AGC GTC CCC ACC ACG ACA ATA CGA CGC CAC 
'Zheu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

GTC GAT TCC CAG CTG TTC ACC ATC TCG CCT CGC CGG CAT GAG ACG GTG 

Val Asp Ser Gin Leu Phe Thr He Ser Pro Arg Arg His Glu Thr Val 

145 150 155 160 

CAG GAC TGC AAT TGC TCA ATC TAT CCC GGC CAC ATA ACG GGT CAC CGT 

Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg 

165 170 175 

ATG GCT TGG GAT ATG ATG ATG AAC TGG TCG CCT ACA ACG GCC CTG GTG 
Met Ala Trp Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val 
180 185 190 

GTA TCG CAG CTG CTC CGG ATC CTC TAATAG 
Val Ser Gin Leu Leu Arg He Leu 
195 200 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 200 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Met Leu Gly Lys Vai He Ast> Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

Leu Ser Cys Leu Thr Vai Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

-Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 

=! 85 90 95 

-Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 

t 100 105 110 

-Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Vai Ala Leu Thr Pro Thr 
!' 115 120 125 

^Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

%al Aso Ser Gin Leu Phe Thr He Ser Pro Arg Arg His Glu Thr Vai 
-145 * 150 155 160 

"Gin Aso Cys Asn Cys Ser lie Tyr Pro Glv His He Thr Gly His Arg 
1 * 165 170 175 

Met Ala Trp Asd Met Met Met Asn Tro Ser Pro Thr Thr Ala Leu Val 
180 185 190 

Val Ser Gin Leu Leu Arg He Leu 
195 200 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..633 



(ix)' FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 1..630 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

ATG TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC GGC TTC GCC GAC CTC 43 
Met Leu Gly Lys Val He Asd Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

GTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 96 
Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT GCA 14 4 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

:iACA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG 192 

: !Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 

3 50 55 60 

"IcTG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG 24 0 

! Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 

J 65 70 75 30 

;=TCC GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 28 8 

^Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 

;; 85 90 95 

TAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 336 
Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

GTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 384 
Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

CTC GCA GCT AGG AAC GCC AGC GTC CCC ACC ACG ACA ATA CGA CGC CAC 4 32 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

GTC GAT TCC CAG CTG TTC ACC ATC TCG CCT CGC CGG CAT GAG ACG GTG 430 
Val Asp Ser Gin Leu Phe Thr He Ser Pro Arg Arg His Glu Thr Val 
145 " 150 155 160 

CAG GAC TGC AAT TGC TCA ATC TAT CCC GGC CAC ATA ACG GGT CAC CGT 528 
Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg 
165 170 175 

ATG GCT TGG GAT ATG ATG ATG AAC TGG TCG CCT ACA ACG GCC CTG GTG 57 6 

Met Ala Trp Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val 
180 185 190 

GTA TCG CAG CTG CTC CGG ATC GTG ATC GAG GGC AGA CAC CAT CAC CAC 624 
Val Ser Gin Leu Leu Arg He Val He Glu Gly Arg His His His His 
195 200 205 



CAT CAC TAATAG 
His His 



636 




210 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Met Leu Gly Lvs Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

Wal Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 

£i 20 25 30 

~|la Leu Ala His Gly Val Arg Val Leu GIu Asp Gly Val Asn Tyr Ala 
35 40 45 

%jhr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 

p 50 55 60 

-Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 

D 65 70 75 80 

Jer Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
D 85 90 95 

OVr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
U 100 105 110 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

Val Aso Ser Gin Leu Phe Thr He Ser Pro Arg Arg His Glu Thr Val 
145 ' 150 155 160 

Gin Asd Cys Asn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg 
165 170 175 

Met Ala Trp Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val 
180 185 190 

Val Ser Gin Leu Leu Arg He Val He Glu Gly Arg His His His His 



His His 
210 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 630 base pairs 



195 



200 



205 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..627 

(ix) FEATURE: 
r. : (A) NAME /KEY : mat_peptide 

;S (B) LOCATION: 1..624 

€} (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

hjATG GGT AAG GTC ATC GAT ACC CTT ACG TGC GGA TTC GCC GAT CTC ATG 4 3 

S-Met Gly Lys Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu Met 

Sri 15 10 15 

s GGG TAC ATC CCG CTC GTC GGC GCT CCC GTA GGA GGC GTC GCA AGA GCC 96 
□ Gly Tyr lie Pro Leu Val Gly Ala Pro Val Gly Gly Val Ala Arg Ala 

\j 20 25 30 

S CTT GCG CAT GGC GTG AGG GCC CTT GAA GAC GGG ATA AAT TTC GCA ACA 14 4 

2=! Leu Ala His Gly Val Arg Ala Leu Glu Asp Gly He Asn Phe Ala Thr 
U 35 40 45 

GGG AAT TTG CCC GGT TGC TCC TTT TCT ATT TTC CTT CTC GCT CTG TTC 192 
Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu Phe 
50 55 60 

TCT TGC TTA ATT CAT CCA GCA GCT AGT CTA GAG TGG CGG AAT ACG TCT 240 
Ser Cys Leu He His Pro Ala Ala Ser Leu Glu Trp Arg Asa Thr Ser 
65 70 75 80 

GGC CTC TAT GTC CTT ACC AAC GAC TGT TCC AAT AGC AGT ATT GTG TAC 288 
Gly Leu Tyr Val Leu Thr Asn Asp Cys Ser Asn Ser Ser He Val Tyr 
85 90 95 

GAG GCC GAT GAC GTT ATT CTG CAC ACA CCC GGC TGC ATA CCT TGT GTC 336 
Glu Ala Asd Asp Val He Leu His Thr Pro Gly Cys He Pro Cys Val 
100 105 110 

CAG GAC GGC AAT ACA TCC ACG TGC TGG ACC CCA GTG ACA CCT ACA GTG 384 
Gin Asp Gly Asn Thr Ser Thr Cys Trp Thr Pro Val Thr Pro Thr Val 
115 120 125 

GCA GTC AAG TAC GTC GGA GCA ACC ACC GCT TCG ATA CGC AGT CAT GTG 432 
Ala Val Lys Tyr Val Gly Ala Thr Thr Ala Ser He Arg Ser His Val 
130 135 140 

GAC CTA TTA GTG GGC GCG GCC ACG ATG TGC TCT GCG CTC TAC GTG GGT 4 80 

Aso Leu Leu Val Glv Ala Ala Thr Met Cys Ser Ala Leu Tyr Val Gly 
145 - 150 155 160 



GAC ATG TGT GGG GCT GTC TTC CTC GTG GGA CAA GCC TTC ACG TTC AGA 528 
Asd Met Cys Gly Ala Val Phe Leu Val Gly Gin Ala Phe Thr Phe Arg 
165 170 175 

CCT CGT CGC CAT CAA ACG GTC CAG ACC TGT AAC TGC TCG CTG TAC CCA 576 
Pro Arg Arg His Gin Thr Val Gin Thr Cys Asn Cys Ser Leu Tyr Pro 
180 185 190 

GGC CAT CTT TCA GGA CAT CGA ATG GCT TGG GAT ATG ATG ATG AAC TGG 624 
Gly His Leu Ser Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 
195 200 205 



TAATAG 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Met Gly Lys Val He Asd Thr Leu Thr Cys Gly Phe Ala Asp Leu Met 
15* 10 15 

Gly ^yr He Pro Leu Val Gly Ala Pro Val Gly Gly Val Ala Arg Ala 
20 25 30 

Leu Ala His Gly Val Arg Ala Leu Glu Asp Gly lie Asn Phe Ala Thr 
35 40 45 

Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu Phe 
50 55 60 

Ser Cys Leu He His Pro Ala Ala Ser Leu Glu Trp Arg Asn Thr Ser 
65 70 75 80 

Gly Leu Tyr Val Leu Thr Asn Asp Cys Ser Asn Ser Ser He Val Tyr 
85 " 90 95 

Glu Ala Asp Asp Val He Leu His Thr Pro Gly Cys He Pro Cys Val 
100 105 110 

Gin Asd Gly Asn Thr Ser Thr Cys Trp Thr Pro Val Thr Pro Thr Val 
115 120 125 

Ala Val Lys Tyr Val Gly Ala Thr Thr Ala Ser He Arg Ser His Val 
130 135 140 

Asp Leu Leu Val Gly Ala Ala Thr Met Cys Ser Ala Leu Tyr Val Gly 
145 150 155 160 

Asp Met Cys Gly Ala Val Phe Leu Val Gly Gin Ala Phe Thr Phe Arg 
165 170 175 



634 



>ro Arg Arg His Gin Thr Val Gin Thr Cys Asn Cys Ser Leu Tyr Pro 

180 185 190 

Gly His Leu Ser Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 

195 200 205 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 630 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1.-627 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 
(3) LOCATION: 1..624 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

ATG GGT AAG GTC ATC GAT ACC CTA ACG TGC GGA TTC GCC GAT CTC ATG 
Met Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu Met 
15 10 15 



48 



GGG TAT ATC CCG CTC GTA GGC GGC CCC ATT GGG GGC GTC GCA AGG GCT 96 
Gly Tyr He Pro Leu Val Gly Gly Pro He Gly Gly Val Ala Arg Ala 
20 25 30 

CTC GCA CAC GGT GTG AGG GTC CTT GAG GAC GGG GTA AAC TAT GCA ACA 144 
Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala Thr 
35 40 45 

GGG AAT TTA CCC GGT TGC TCT TTC TCT ATC TTT ATT CTT GCT CTT CTC 192 
Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe He Leu Ala Leu Leu 
50 55 60 

TCG TGT CTG ACC GTT CCG GCC TCT GCA GTT CCC TAC CGA AAT GCC TCT 240 
Ser Cys Leu Thr Val Pro Ala Ser Ala Val Pro Tyr Arg Asn Ala Ser 
65 70 75 80 

GGG ATT TAT CAT GTT ACC AAT GAT TGC CCA AAC TCT TCC ATA GTC TAT 288 
Hy He Tyr His Val Thr Asn Asp Cys Pro Asn Ser Ser He Val Tyr 
85 90 95 

GAG GCA GAT AAC CTG ATC CTA CAC GCA CCT GGT TGC GTG CCT TGT GTC 336 



Glu Ala Asp Asn Leu He Leu His Ala Pro Gly Cys Val Pro Cys Val 
100 105 110 

ATG ACA GGT AAT GTG AGT AGA TGC TGG GTC CAA ATT ACC CCT ACA CTG 38 4 

Met Thr Gly Asn Val Ser Arg Cys Trp Val Gin He Thr Pro Thr Leu 
115 120 125 

TCA GCC CCG AGC CTC GGA GCA GTC ACG GCT CCT CTT CGG AGA GCC GTT 432 
Ser Ala Pro Ser Leu Gly Ala Val Thr Ala Pro Leu Arg Arg Ala Val 
130 135 140 

GAC TAC CTA GCG GGA GGG GCT GCC CTC TGC TCC GCG TTA TAC GTA GGA 480 
Asp Tyr Leu Ala Gly Gly Ala Ala Leu Cys Ser Ala Leu Tyr Val Gly 
145 150 155 160 

GAC GCG TGT GGG GCA CTA TTC TTG GTA GGC CAA ATG TTC ACC TAT AGG 528 
Asp Ala Cys Gly Ala Leu Phe Leu Val Gly Gin Met Phe Thr Tyr Arg 
165 170 175 

CCT CGC CAG CAC GCT ACG GTG CAG AAC TGC AAC TGT TCC ATT TAC AGT 576 

rpro Arg Gin His Ala Thr Val Gin Asn Cys Asn Cys Ser He Tyr Ser 
J 180 185 190 

JGGC CAT GTT ACC GGC CAC CGG ATG GCA TGG GAT ATG ATG ATG AAC TGG 624 

^Gly His Val Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 
"i 195 200 205 



:. T AAT AG 



=J(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 208 amino acids 
(3) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Met Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu Met 
15 10 15 

Gly Tyr He Pro Leu Val Gly Gly Pro He Gly Gly -Val Ala Arg Ala 
20 25 30 

Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala Thr 
35 40 45 

Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe He Leu Ala Leu Leu 
50 55 60 

Ser Cys Leu Thr Val Pro Ala Ser Ala Val Pro Tyr Arg Asn Ala Ser 
65 70 75 80 

Gly He Tyr His Val Thr Asn Asp Cys Pro Asn Ser Ser lie Val Tyr 
85 90 95 



630 



Glu Ala Asp Asn Leu lie Leu 
100 

Met Thr Gly Asn Val Ser Arg 
115 

Ser Ala Pro Ser Leu Gly Ala 
130 135 

Asp Tyr Leu Ala Gly Gly Ala 
145 150 

Asp Ala Cys Gly Ala Leu Phe 
165 

Pro Arg Gin His Ala Thr Val 

180 

'•Gly His Val Thr Gly His Arg 
195 



](2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



' {xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

TGGGATATGA TGATGAACTG GTC 

(2) INFORMATION FOR SEQ ID NO: 34: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE : NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 



His Ala Pro Gly Cys Val Pro Cys Val 
105 110 

Cys Trp Val Gin lie Thr Pro Thr Leu 
120 125 

Val Thr Ala Pro Leu Arg Arg Ala Val 
140 

Ala Leu Cys Ser Ala Leu Tyr Val Gly 
155 160 

Leu Val Gly Gin Met Phe Thr Tyr Arg 
170 175 

Gin Asn Cys Asn Cys Ser lie Tyr Ser 
185 190 



Met Ala Trp Asp Met Met Met Asn Trp 

200 205 



• 



CTATTATGGT GGTAAGCCAC AGAGCAGGAG 30 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1476 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 1..1473 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptice 

(B) LOCATION: 1..1470 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 



tt GG 


GAT 


ATG 


ATG 


ATG 


AAC 


TGG 


TCG 


CCT 


ACA 


ACG 


GCC 


CTG 


GTG 


GTA 


TCG 


48 




Asp 


Met 


Met 


Met 


Asn 


Trp 


Ser 


Pro 


Thr 


Thr 


Ala 


Leu 


Val 


Val 


Ser 




y i 






5 










10 










15 






Ceag 


CTG 


CTC 


CGG 


ATC 


CCA 


CAA 


GCT 


GTC 


GTG 


GAC 


ATG 


GTG 


GCG 


GGG 


GCC 


96 


jjJSlri 


Leu 


Leu 


Arg 


He 


Pro 


Gin 


Ala 


Val 


Val 


Asp 


Met 


Val 


Ala 


Gly 


Ala 










20 










25 










30 








CAT 


TGG 


GGA 


GTC 


CTG 


GCG 


GGC 


CTC 


GCC 


TAC 


TAT 


TCC 


ATG 


GTG 


GGG 


AAC 


144 


His 


Trp 


Gly 


Val 


Leu 


Ala 


Gly 


Leu 


Ala 


Tyr 


Tyr 


Ser 


Met 


Val 


Gly 


Asn 








35 










40 










45 










TGG 


GCT 


AAG 


GTT 


TTG 


GTT 


GTG 


ATG 


CTA 


CTC 


TTT 


GCC 


GGC 


GTC 


GAC 


GGG 


192 


Trp 


Ala 


Lys 


Val 


Leu 


Val 


Val 


Met 


Leu 


Leu 


Phe 


Ala 


Gly Val Asp 


Gly 






50 










55 










60 












CAT 


ACC 


CGC 


GTG 


TCA 


GGA 


GGG 


GCA 


GCA 


GCC 


TCC 


GAT 


ACC 


AGG 


GGC 


CTT 


240 


His 


Thr 


Arg 


Val 


Ser 


Gly 


Gly 


Ala 


Ala 


Ala 


Ser 


Asp 


Thr 


Arg Gly 


Leu 




65 










70 










75 










80 




GTG 


TCC 


CTC 


TTT 


AGC 


CCC 


GGG 


TCG 


GCT 


CAG 


AAA 


ATC 


CAG 


CTC 


GTA 


AAC 


288 


Val 


Ser 


Leu 


Phe 


Ser 


Pro 


Gly 


Ser 


Ala 


Gin 


Lys 


He 


Gin 


Leu 


Val 


Asn 












85 








90 










95 






ACC 


AAC 


GGC 


AGT 


TGG 


CAC 


ATC 


AAC 


AGG 


ACT 


GCC 


CTG 


AAC 


TGC 


AAC 


GAC 


336 


Thr 


Asn 


Gly 


Ser 


Trp 


His 


He 


Asn 


Arg 


Thr 


Ala 


Leu 


Asn Cys 


Asn 


Asp 










100 










105 










110 








TCC 


CTC 


CAA 


ACA 


GGG 


TTC 


TTT 


GCC 


GCA 


CTA 


TTC 


TAC 


AAA 


CAC 


AAA 


TTC 


384 


Ser 


Leu 


Gin 


Thr 


Gly 


Phe 


Phe 


Ala 


Ala 


Leu 


Phe 


Tyr 


Lys 


His 


Lys 


Phe 








115 










120 










125 










AAC 


TCG 


TCT 


GGA 


TGC 


CCA 


GAG 


CGC 


TTG 


GCC 


AGC 


TGT 


CGC 


TCC 


ATC 


GAC 


432 



HCVENV-T 



Asn 


ber 


Ser 


biy 


p , , 

Cys 


Pro 


blU 


Arg 


Leu 




u u 
















AAG 


TTC 


GCT 


CAG 


GGG 


TGG 


GGT 


CCC 


CTC 


Lys 


rne 


Ala 


bin 


Gly 


Trp 


biy 


Pro 


Leu 












1 en 

ijU 








TCG 


GAC 


CAG 


AGG 


CCC 


TAC 


TGC 


TGG 


CAC 


ber 


Asp 


P 1 — 


Arg 


Pro 


Tyr 


P-r »rt 

Cys 


Trp 


HIS 










1 Z" C 

165 










ATT 


GTA 


CCC 


GCG 


TCT 


CAG 


GTG 


TGC 


GGT 


Tl- 
116 


vai 


Pro 


Ala 


ber 


bin 


T7-, 1 

vai 


Cys 


pi ,. 

biy 








1 on 
1 0 U 










1 QC 
10 J 


AGC 


CCT 


GTT 


GTG 


GTG 


GGG 


ACG 


ACC 


GAT 


O v- 

ber 


Pro 


vai 


vai 


vai 


biy 


lnr 


lnr 


Asp 


S3 




i y o 










zuu 




fiUc 


TGG 


GGG 


GCG 


AAC 


GAC 


TCG 


GAT 


GTG 


^Asn 


Trp 


Gly 


Ala 


Asn 


Asp 


Ser 


Asp 


Vai 




210 










215 






|£CG 


CCG 


CGA 


GGC 


AAC 


TGG 


TTC 


GGC 


TGT 


3>ro 


Pro 


Arg 


Gly 


Asn 


Trp 


Phe 


Gly 


Cys 


TZ25 










230 








5 TTC 


ACC 


AAG 


ACG 


TGT 


GGG 


GGC 


CCC 


CCG 


tie 


Tnr 


Lys 


Tnr 


Cys 


P 1 t r 

Gly 


pi 

Gly 


Pro 


Ti -v- a 

Pro 










245 










^s^AC 


AAC 


ACC 


TTG 


ACC 


TGC 


CCC 


ACT 


GAC 


'!lAsn 


Asn 


lnr 


Leu 


Tnr 


Cys 


Pro 


1 nr 


Asp 








260 










265 


HSCC 


ACC 


TAG 


GCC 


AGA 


TGC 


GGT 


TCT 


GGG 


Ala 


i nr 


Tyr 


Aia 


Arg 


Cys 


G-Ly 


oer 


biy 
















£. o U 




ATG 


GTT 


CAT 


TAC 


CCA 


TAT 


AGG 


CTC 


TGG 


Met 


vai 


HIS 


Tyr 


Pro 


Tyr 


Arg 


Leu 


Trp 




^ yu 
















TTC 


ACC 


ATC 


TTC 


AAG 


GTT 


AGG 


ATG 


TAC 


Phe 


Thr 


lie 


Phe 


Lys 


Vai 


Arg 


Met 


Tyr 


one 

305 










310 








TTC 


GAA 


GCC 


GCA 


TGC 


AAT 


TGG 


ACT 


CGA 


Phe 


Glu 


Ala 


Ala 


Cys 


Asn 


Trp 


Thr 


Arg 










325 










GAC 


AGG 


GAT 


AGA 


TCA 


GAG 


CTT 


AGC 


CCG 


Asp 


Arg 


Asp 


Arg 


Ser 


Glu 


Leu 


Ser 


Pro 








340 










345 


TGG 


CAG 


ATA 


CTG 


CCC 


TGT 


TCC 


TTC 


ACC 


Trp 


Gin 


He 


Leu 


Pro 


Cys 


Ser 


Phe 


Thr 






355 










360 




GGC 


CTG 


ATC 


CAC 


CTC 


CAT 


CAG 


AAC 


ATC 


Gly 


Leu 


He 


His 


Leu 


His 


Gin 


Asn 


He 


370 










375 








Ala 


Ser 


Cys 


Arg 


Ser 


He 


Asp 








1 3 U 












ACT 


TAC 


ACT 


GAG 


CCT 


AAC 


AGC 


430 


lnr 


Tyr 


Tnr 


pi ,, 

GlU 


Pro 


Asn 


Ser 






loo 










1 OU 




TAC 


GCG 


CCT 


CGA 


CCG 


TGT 


GGT 


528 


Tyr 


Ala 


Pro 


Arg 


Pro 


Cys 


Gly 




i 7 n 
1 / U 










1 Id 






CCA 


GTG 


TAT 


TGC 


TTC 


ACC 


CCG 


576 


Pro 


T T _ 1 

vai 


Tyr 


bys 


rue 


Tnr 


Pro 












190 








CGG 


TTT 


GGT 


GTC 


CCC 


ACG 


TAT 


624 


Arg 


Pne 


pi ,, 

Gly 


Vai 


Pro 


Thr 


Tyr 










O Pi c 

205 










CTG 


ATT 


CTC 


AAC 


AAC 


ACG 


CGG 


672 


Leu 


lie 


Leu 


Asn 


Asn 


Thr 


Arg 








220 










ACA 


TGG 


ATG 


AAT 


GGC 


ACT 


GGG 


720 


Thr 


Trp 


Met 


Asn 


Gly 


Thr 


Gly 






235 










240 




TGC 


AAC 


ATC 


GGG 


GGG 


GCC 


GGC 


768 


Cys 


Asn 


lie 


oxy 


Gly 


Ala 


Gly 




250 










255 






TGT 


TTT 


CGG 


AAG 


CAC 


CCC 


GAG 


816 


Cys 


Phe 


Arg 


Lys 


His 


Pro 


Glu 












270 








CCC 


TGG 


CTG 


ACA 


CCT 


AGG 


TGT 


864 


Pro 


Trp 


Leu 


Thr 


Pro 


Arg 


Cys 










9 q c; 










CAC 


TAC 


CCC 


TGC 


ACT 


GTC 


AAC 


912 


His 


Tyr 


Pro 


Cys 


Thr 


Vai 


Asn 








*3 n 0 












GTG 


GGG 


GGC 


GTG 


GAG 


CAC 


AGG 


960 


Vai 


Gly 


Gly 


Vai 


Glu 


His 


Arg 






315 










320 




GGA 


GAG 


CGT 


TGT 


GAC 


TTG 


GAG 


1008 


Gly 


Glu 


Arg 


Cys 


Asp 


Leu 


Glu 




330 










335 






CTG 


CTG 


CTG 


TCT 


ACA 


ACA 


GAG 


1056 


Leu 


Leu 


Leu 


Ser 


Thr 


Thr 


Glu 












*5 1; n 








ACC 


CTG 


CCG 


GCC 


CTA 


TCC 


ACC 


1104 


Thr 


Leu 


Pro 


Ala 


Leu 


Ser 


Thr 










365 










GTG 


GAC 


GTG 


CAA 


TAC 


CTG 


TAC 


1152 


Vai 


Asp 


Vai 


Gin 


Tyr 


Leu Tyr 





380 



GGT GTA GGG TCG GCG GTT GTC TCC CTT GTC ATC AAA TGG GAG TAT GTC 1200 

Gly Val Gly Ser Ala Val Val Ser Leu Val lie Lys Trp Glu Tyr Val 
385 390 395 400 

CTG TTG CTC TTC CTT CTC CTG GCA GAC GCG CGC ATC TGC GCC TGC TTA 1248 

Leu Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg lie Cys Ala Cys Leu 
405 410 415 

TGG ATG ATG CTG CTG ATA GCT CAA GCT GAG GCC GCC TTA GAG AAC CTG 1296 

Trp Met Met Leu Leu lie Ala Gin Ala Glu Ala Ala Leu Glu Asn Leu 
420 425 430 

GTG GTC CTC AAT GCG GCG GCC GTG GCC GGG GCG CAT GGC ACT CTT TCC 134 4 

Val Val Leu Asn Ala Ala Ala Val Ala Gly Ala His Gly Thr Leu Ser 
435 440 445 

f-TTC CTT GTG TTC TTC TGT GCT GCC TGG TAC ATC AAG GGC AGG CTG GTC 1392 

llPhe Leu Val Phe Phe Cys Ala Ala Trp Tyr lie Lys Gly Arg Leu Val 

j; 450 455 460 

v&CCT GGT GCG GCA TAC GCC TTC TAT GGC GTG TGG CCG CTG CTC CTG CTT 144 0 

yiiPro Gly Ala Ala Tyr Ala Phe Tyr Gly Val Trp Pro Leu Leu Leu Leu 

h|465 470 475 430 

J7-CTG CTG GCC TTA CCA CCA CGA GCT TAT GCC TAGTAA 147 6 

^Leu Leu Ala Leu Pro Pro Arg Ala Tyr Ala 
s _ 485 490 

N! (2) INFORMATION FOR 3EQ ID NO: 36: 

(il SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 90 amino acids 
(3) TYPE: amino acid 

£: (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Trp Aso Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val Val Ser 

.1 5 10 15 

Gin Leu Leu Arg He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala 
20 25 30 

His Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn 
35 40 45 

Trp Ala Lys Val Leu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly 

50 55 60 

His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu 
65 70 75 80 

Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys He Gin Leu Val Asn 
85 90 95 

Thr Asn Gly Ser Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp 
100 105 110 



HCVENV T 



Ser Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr Lys His Lys Phe 
115 120 125 

Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys Arg Ser lie Asp 
130 135 140 

Lys Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser 
145 150 155 160 

Ser Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly 
165 170 175 

lie Val Pro Ala Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro 
130 185 190 

Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr 
195 200 205 

Asn Trp Glv Ala Asn Asp Ser Asp Val Leu lie Leu Asn Asn Thr Arg 
210 " 215 " 220 

Pro Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp Met Asn Gly Thr Gly 
:;225 230 235 240 

.Phe Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn lie Gly Gly Ala Gly 
245 250 255 

Asn Asn Thr Leu Thr Cys Pro Thr Aso Cys Phe Arg Lys His Pro Glu 

j 260 265 270 

=:Aia Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys 
275 280 285 

i*Met Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn 
290 295 300 

Phe Thr lie Phe Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg 
305 310 315 320 

Phe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu 
325 330 335 

Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu 
340 345 350 

Trp Gin lie Leu Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr 
355 360 365 

Gly Leu lie His Leu His Gin Asn lie Val Asd Val Gin Tyr Leu Tyr 
370 375 " 380 

Gly Val Gly Ser Ala Val Val Ser Leu Val He Lys Trp Glu Tyr Val 
385 390 395 400 

Leu Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg He Cys Ala Cys Leu 
405 410 415 

Trp Met Met Leu Leu He Ala Gin Ala Glu Ala Ala Leu Glu Asn Leu 
420 425 430 

Val Val Leu Asn Ala Ala Ala Val Ala Gly Ala His Gly Thr Leu Ser 
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435 440 445 

Phe Leu Val Phe Phe Cys Ala Ala Trp Tyr He Lys Gly Arg Leu Val 
450 455 460 

Pro Gly Ala Ala Tyr Ala Phe Tyr Gly Val Trp Pro Leu Leu Leu Leu 
465 470 475 480 

Leu Leu Ala Leu Pro Pro Arg Ala Tyr Ala 
485 490 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1021 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
} (iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE : NO 



(ix) FEATURE: 

(A) NAME /KEY : CDS 
? (B) LOCATION: 2. . 1018 

!; (ix) FEATURE: 

h (A) NAME /KEY : mat_peptide 

« ; (B) LOCATION: 2. . 1015 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

G ATC CCA CAA GCT GTC GTG GAC ATG GTG GCG GGG GCC CAT TGG GGA 4 6 

He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly 
15 10 15 

GTC CTG GCG GGC CTC GCC TAG TAT TCC ATG GTG GGG AAC TGG GCT AAG 94 
Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys 
20 25 30 

GTT TTG GTT GTG ATG CTA CTC TTT GCC GGC GTC GAC GGG CAT ACC CGC 14 2 

Val Leu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly His Thr Arg 
35 40 45 

GTG TCA GGA GGG GCA GCA GCC TCC GAT ACC AGG GGC CTT GTG TCC CTC 190 
Val Ser Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val Ser Leu 
50 55 " 60 

TTT AGC CCC GGG TCG GCT CAG AAA ATC CAG CTC GTA AAC ACC AAC GGC 238 
Phe Ser Pro Gly Ser Ala Gin Lys He Gin Leu Val Asn Thr Asn Gly 
65 70 75 

AGT TGG CAC ATC AAC AGG ACT GCC CTG AAC TGC AAC GAC TCC CTC CAA 28 6 

Ser Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Gin 
80 85 90 95 



ACA GGG TTC TTT GCC GCA CTA TTC TAG AAA CAC AAA TTC AAC TCG TCT 334 

Thr Gly Phe Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn Ser Ser 
100 105 110 

GGA TGC CCA GAG CGC TTG GCC AGC TGT CGC TCC ATC GAC AAG TTC GCT 382 

Gly Cys Pro Glu Arg Leu Ala Ser Cys Arg Ser lie Asp Lys Phe Ala 
115 120 * 125 

CAG GGG TGG GGT CCC CTC ACT TAC ACT GAG CCT AAC AGC TCG GAC CAG 430 

Gin Gly Trp Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser Ser Asp Gin 
130 135 140 

AGG CCC TAC TGC TGG CAC TAC GCG CCT CGA CCG TGT GGT ATT GTA CCC 4 78 

Arg Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly lie Val Pro 

145 150 155 

GCG TCT CAG GTG TGC GGT CCA GTG TAT TGC TTC ACC CCG AGC CCT GTT 526 

Ala Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val 

160 165 170 175 

GTG GTG GGG ACG ACC GAT CGG TTT GGT GTC CCC ACG TAT AAC TGG GGG 574 

Val Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Asn Trp Gly 
180 185 190 

=:GCG AAC GAC TCG GAT GTG CTG ATT CTC AAC AAC ACG CGG CCG CCG CGA 622 

■Ala Asn Aso Ser Aso Val Leu lie Leu Asn Asn Thr Arg Pro Pro Arg 
:i * 195 * 200 205 

liGGC AAC TGG TTC GGC TGT ACA TGG ATG AAT GGC ACT GGG TTC ACC AAG 670 

iGly Asn Trp Phe Gly Cys Thr Trp Met Asn Gly Thr Gly Phe Thr Lys 
210 215 220 

ACG TGT GGG GGC CCC CCG TGC AAC ATC GGG GGG GCC GGC AAC AAC ACC 718 

Thr Cys Gly Glv Pro Pro Cys Asn lie Gly Gly Ala Gly Asn Asn Thr 

225 " 230 235 

TTG ACC TGC CCC ACT GAC TGT TTT CGG AAG CAC CCC GAG GCC ACC TAC 7 66 

Leu Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr Tyr 

240 245 250 255 

GCC AGA TGC GGT TCT GGG CCC TGG CTG ACA CCT AGG TGT ATG GTT CAT 814 

Ala Arg Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met Val His 
260 265 270 

TAC CCA TAT AGG CTC TGG CAC TAC CCC TGC ACT GTC AAC TTC ACC ATC 8 62 

Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr He 
275 280 285 

TTC AAG GTT AGG ATG TAC GTG GGG GGC GTG GAG CAC AGG TTC GAA GCC 910 

Phe Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Phe Glu Ala 
290 295 300 

GCA TGC AAT TGG ACT CGA GGA GAG CGT TGT GAC TTG GAG GAC AGG GAT 958 

Ala Cys Asn Tro Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp 

305 ' 310 315 

AGA TCA GAG CTT AGC CCG CTG CTG CTG TCT ACA ACA GAG TGG CAG AGT 1006 

Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp Gin Ser 

320 325 330 335 

GGC AGA GCT TAATTA 1021 



Gly Arg Ala 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 338 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly Val 
15 10 15 

Jl Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val 
20 25 30 

Leu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly His Thr Arg Val 
35 4 0 45 

; Ser Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val Ser Leu Phe 
50 55 60 

Ser Pro Gly Ser Ala Gin Lys He Gin Leu Val Asn Thr Asn Gly Ser 
; 65 70 75 80 

; Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Gin Thr 
85 90 95 

Gly Phe Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn Ser Ser Gly 
100 105 110 

Cys Pro Glu Arg Leu Ala Ser Cys Arg Ser He Asp Lys Phe Ala Gin 
115 120 125 

Gly Trp Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser Ser Asp Gin Arg 
130 135 140 

Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly He Val Pro Ala 
145 150 155 160 

Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val Val 
165 170 175 

Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Asn Trp Gly Ala 
180 185 190 

Asn Aso Ser Asp Val Leu He Leu Asn Asn Thr Arg Pro Pro Arg Gly 
195 200 205 

Asn Trp Phe Gly Cys Thr Trp Met Asn Gly Thr Gly Phe Thr Lys Thr 
210 215 220 

Cys Gly Gly Pro Pro Cys Asn He Gly Gly Ala Gly Asn Asn Thr Leu 
225 230 235 240 

Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr Tyr Ala 
245 250 255 
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Arg Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met Val His Tyr 
260 265 270 

Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr lie Phe 
275 280 285 

Lys Val Arg Met Tyr Val Gly Gly Val Giu His Arg Phe Giu Ala Ala 
290 295 300 

Cys Asn Trp Thr Arg Gly Giu Arg Cys Asp Leu Giu Asp Arg Asp Arg 
305 310 315 320 

Ser Giu Leu Ser Pro Leu Leu Leu Ser Thr Thr Giu Trp Gin Ser Gly 
325 330 335 

Arg Ala 

S(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS : 

yy (A) LENGTH: 1034 base pairs 

lu (B) TYPE: nucleic acid 

5 (C) STRANDEDNESS: single 

rj (D) TOPOLOGY: linear 

^ (ii) MOLECULE TYPE: cDNA 

SI (iii) HYPOTHETICAL: NO 

S (iii) ANTI-SENSE : NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(3) LOCATION: 2.. 1032 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 2. .1029 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

G ATC CCA CAA GCT GTC GTG GAC ATG GTG GCG GGG GCC CAT TGG GGA 4 6 

He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly 
15 10 15 

GTC CTG GCG GGC CTC GCC TAC TAT TCC ATG GTG GGG AAC TGG GCT AAG 94 
Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys 
20 25 30 

GTT TTG GTT GTG ATG CTA CTC TTT GCC GGC GTC GAC GGG CAT ACC CGC 142 
Val Leu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly His Thr Arg 
35 40 45 

GTG TCA GGA GGG GCA GCA GCC TCC GAT ACC AGG GGC CTT GTG TCC CTC 190 
Val Ser Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val Ser Leu 
50 55 60 
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305 310 315 

AGA TCA GAG CTT AGC CCG CTG CTG CTG TCT ACA ACA GGT GAT CGA GGG 100 6 

Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Gly Asp Arg Gly 

320 325 330 335 

CAG ACA CCA TCA CCA CCA TCA CTA AT AG 1034 

Gin Thr Pro Ser Pro Pro Ser Leu 
340 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 343 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

; ; ; (ii) MOLECULE TYPE: protein 

0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

HEle Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly Val 

r| 1 5 10 15 

"Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val 
y 20 25 30 

ILeu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly His Thr Arg Val 

I; 35 40 45 

Ser Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val Ser Leu Phe 

^Ser Pro Gly Ser Ala Gin Lys lie Gin Leu Val Asn Thr Asn Gly Ser 
65 70 75 SO 

Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Gin Thr 
35 90 95 

Gly Phe Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn Ser Ser Gly 
100 105 110 

Cys Pro Glu Arg Leu Ala Ser Cys Arg Ser He Asp Lys Phe Ala Gin 
115 120 125 

Gly Trp Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser Ser Asp Gin Arg 
130 135 140 

Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly He Val Pro Ala 
145 150 155 160 

Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val Val 
165 170 175 

Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Asn Trp Gly Ala 
180 185 190 

Asn Asp Ser Asp Val Leu He Leu Asn Asn Thr Arg Pro Pro Arg Gly 
195 200 205 
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- Asn Trp Phe Gly Cys Thr Trp Met Asn Gly Thr Gly Phe Thr Lys Thr 
210 215 220 

Cys Gly Gly Pro Pro Cys Asn lie Gly Gly Ala Gly Asn Asn Thr Leu 
225 230 235 240 

Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr Tyr Ala 
245 250 255 

Arg Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met Val His Tyr 
260 265 270 

Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr He Phe 
275 280 285 

Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Phe Glu Ala Ala 
290 295 300 

Mbys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp Arg 
€305 310 315 320 

•^Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Gly Asp Arg Gly Gin 
J 325 330 335 

ijthr Pro Ser Pro Pro Ser Leu 

U 340 



Q{2) INFORMATION FOR 5EQ ID NO: 41: 

,~! (i) SEQUENCE CHARACTERISTICS: 

J;; (A) LENGTH: 94 5 base pairs 

(B) TYPE: nucleic acid 
O (C) STRANDEDNESS: single 

M: (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..942 

(ix) FEATURE: 

(A) NAME /KEY : mat__peptide 

(3) LOCATION: 1..939 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

ATG GTG GGG AAC TGG GCT AAG GTT TTG GTT GTG ATG CTA CTC TTT GCC 48 

Met Val Gly Asn Trp Ala Lys Val Leu Val Val Met Leu Leu Phe Ala 

15 10 15 

GGC GTC GAC GGG CAT ACC CGC GTG TCA GGA GGG GCA GCA GCC TCC GAT 96 

Gly Val Asp Gly His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp 

20 25 30 



0 



ACC AGG GGC CTT GTG TCC CTC TTT AGC CCC GGG TCG GCT CAG AAA ATC 144 
Thr Arg Gly Leu Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys He 
35 40 45 

CAG CTC GTA AAC ACC AAC GGC AGT TGG CAC ATC AAC AGG ACT GCC CTG 192 
Gin Leu Val Asn Thr Asn Gly Ser Trp His He Asn Arg Thr Ala Leu 
50 55 60 

AAC TGC AAC GAC TCC CTC CAA ACA GGG TTC TTT GCC GCA CTA TTC TAC 240 
Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr 
65 70 75 80 

AAA CAC AAA TTC AAC TCG TCT GGA TGC CCA GAG CGC TTG GCC AGC TGT 288 
Lys His Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys 
35 90 95 

CGC TCC ATC GAC AAG TTC GCT CAG GGG TGG GGT CCC CTC ACT TAC ACT 336 

Arg Ser He Aso Lys Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr 
3 100 105 110 

»GAG CCT AAC AGC TCG GAC CAG AGG CCC TAC TGC TGG CAC TAC GCG CCT 38 4 

□Glu Pro Asn Ser Ser Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro 
F; 115 120 125 

SlcGA CCG TGT GGT ATT GTA CCC GCG TCT CAG GTG TGC GGT CCA GTG TAT 432 

s ^Arg Pro Cys Gly He Val Pro Ala Ser Gin Val Cys Gly Pro Val Tyr 

y 130 135 140 

3 TGC TTC ACC CCG AGC CCT GTT GTG GTG GGG ACG ACC GAT CGG TTT GGT 48 0 

ricys Phe Thr Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly 

Jl45 150 155 160 

[ ! GTC CCC ACG TAT AAC TGG GGG GCG AAC GAC TCG GAT GTG CTG ATT CTC 523 
JVai Pro Thr Tyr Asn Tro Gly Ala Asn Asp Ser Asp Val Leu He Leu 
* 165 * 170 175 

AAC AAC ACG CGG CCG CCG CGA GGC AAC TGG TTC GGC TGT ACA TGG ATG 57 6 

Asn Asn Thr Arg Pro Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp Met 
180 185 190 

AAT GGC ACT GGG TTC ACC AAG ACG TGT GGG GGC CCC CCG TGC AAC ATC 624 
Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn He 
195 200 205 

GGG GGG GCC GGC AAC AAC ACC TTG ACC TGC CCC ACT GAC TGT TTT CGG 672 
Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg 
210 215 220 

AAG CAC CCC GAG GCC ACC TAC GCC AGA TGC GGT TCT GGG CCC TGG CTG 7 20 

Lys His Pro Glu Ala Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu 
225 230 235 240 

ACA CCT AGG TGT ATG GTT CAT TAC CCA TAT AGG CTC TGG CAC TAC CCC 768 
Thr Pro Arg Cys Met Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro 
245 250 255 

TGC ACT GTC AAC TTC ACC ATC TTC AAG GTT AGG ATG TAC GTG GGG GGC 816 
Cys Thr Val Asn Phe Thr lie Phe Lys Val Arg Met Tyr Val Gly Gly 
260 265 270 

GTG GAG CAC AGG TTC GAA GCC GCA TGC AAT TGG ACT CGA GGA GAG CGT 864 



HCVENV.-T 



Val Glu His Arg Phe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg 
275 280 285 

TGT GAC TTG GAG GAC AGG GAT AGA TCA GAG CTT AGC CCG CTG CTG CTG 912 
Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu 
290 295 300 

TCT ACA ACA GAG TGG CAG AGC TTA ATT AAT TAG 945 
Ser Thr Thr Glu Trp Gin Ser Leu He Asn 
305 310 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 314 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

I (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

"Met Val Gly Asn Trp Ala Lys Val Leu Val Val Met Leu Leu Phe Ala 
;; 1 5 10 15 

JGly Val Asp Gly His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp 
20 25 30 

:Thr Arg Gly Leu Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys He 
35 40 45 

Gin Leu Val Asn Thr Asn Gly Ser Trp His He Asn Arg Thr Ala Leu 
50 55 60 

Asn Cys Asn Aso Ser Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr 
65 " 70 75 80 

Lys His Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys 
85 90 95 

Arg Ser He Aso Lys Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr 
100 105 110 

Glu Pro Asn Ser Ser Aso Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro 
115 M 120 125 

Arg Pro Cys Gly He Val Pro Ala Ser Gin Val Cys Gly Pro Val Tyr 
130 135 140 

Cys Phe Thr Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly 
145 150 155 160 

Val Pro Thr Tyr Asn Trp Gly Ala Asn Asp Ser Asp Val Leu He Leu 
165 170 175 

Asn Asn Thr Arg Pro Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp Met 
180 185 190 

Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn He 
195 200 205 



Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg 
210 215 220 

Lys His Pro Glu Ala Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu 
225 230 235 240 

Thr Pro Arg Cys Met Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro 
245 250 255 

Cys Thr Val Asn Phe Thr He Phe Lys Val Arg Met Tyr Val Gly Gly 
260 265 270 

Val Glu His Arg Phe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg 
275 280 285 

Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu 
q 290 295 300 

S'Ser Thr Thr Glu Trp Gin Ser Leu He Asn 
^305 310 

C(2) INFORMATION FOR SEQ ID NO: 43: 

O (i) SEQUENCE CHARACTERISTICS: 
|7i (A) LENGTH: 961 base pairs 

l"' (3) TYPE: nucleic acid 

JL (C) STRANDEDNESS: single 

^ (D) TOPOLOGY: linear 

C (ii) MOLECULE TYPE: cDNA 

p (iii) HYPOTHETICAL : NO 

f " (iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..958 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 1 . . 955 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

ATG GTG GGG AAC TGG GCT AAG GTT TTG GTT GTG ATG CTA CTC TTT GCC 48 

Met Val Gly Asn Tro Ala Lvs Val Leu Val Val Met Leu Leu Phe Ala 

1 5 10 15 

GGC GTC GAC GGG CAT ACC CGC GTG TCA GGA GGG GCA GCA GCC TCC GAT 96 

Gly Val Asp Gly His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp 

20 25 30 

ACC AGG GGC CTT GTG TCC CTC TTT AGC CCC GGG TCG GCT CAG AAA ATC 144 

Thr Arg Gly Leu Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys lie 

35 40 45 

CAG CTC GTA AAC ACC AAC GGC AGT TGG CAC ATC AAC AGG ACT GCC CTG 192 



Gin Leu Val Asn Thr Asn Gly Ser Tro His lie Asn Arg Thr Ala Leu 

50 55 " 60 

AAC TGC AAC GAC TCC CTC CAA ACA GGG TTC TTT GCC GCA CTA TTC TAC 240 

Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr 

65 70 75 80 

AAA CAC AAA TTC AAC TCG TCT GGA TGC CCA GAG CGC TTG GCC AGC TGT 288 

Lys His Lys Phe Asn Ser Ser Gly Cys Pro Giu Arg Leu Ala Ser Cys 

85 90 95 

CGC TCC ATC GAC AAG TTC GCT CAG GGG TGG GGT CCC CTC ACT TAC ACT 33 6 

Arg Ser He Asp Lys Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr 

100 105 110 

GAG CCT AAC AGC TCG GAC CAG AGG CCC TAC TGC TGG CAC TAC GCG CCT 384 

Glu Pro Asn Ser Ser Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro 

115 ' 120 125 

CGA CCG TGT GGT ATT GTA CCC GCG TCT CAG GTG TGC GGT CCA GTG TAT 4 32 

Arg Pro Cys Gly He Val Pro Ala Ser Gin Val Cys Gly Pro Val Tyr 

130 135 140 

;TGC TTC ACC CCG AGC CCT GTT GTG GTG GGG ACG ACC GAT CGG TTT GGT 4 80 

^Cys Phe Thr Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly 

1145 150 155 160 

GTC CCC ACG TAT AAC TGG GGG GCG AAC GAC TCG GAT GTG CTG ATT CTC 528 

: Val Pro Thr Tyr Asn Trp Gly Ala Asn Asp Ser Asp Val Leu He Leu 

: 165 170 175 

;: AAC AAC ACG CGG CCG CCG CGA GGC AAC TGG TTC GGC TGT ACA TGG ATG 57 6 

iAsn Asn Thr Arg Pro Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp Met 

180 185 190 

'AAT GGC ACT GGG TTC ACC AAG ACG TGT GGG GGC CCC CCG TGC AAC ATC 624 

Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn He 

195 200 205 

GGG GGG GCC GGC AAC AAC ACC TTG ACC TGC CCC ACT GAC TGT TTT CGG 672 

Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg 

210 215 220 

AAG CAC CCC GAG GCC ACC TAC GCC AGA TGC GGT TCT GGG CCC TGG CTG 720 

Lys His Pro Glu Ala Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu 

225 230 235 240 

ACA CCT AGG TGT ATG GTT CAT TAC CCA TAT AGG CTC TGG CAC TAC CCC 768 

Thr Pro Arg Cys Met Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro 

245 250 255 

TGC ACT GTC AAC TTC ACC ATC TTC AAG GTT AGG ATG TAC GTG GGG GGC 816 

Cys Thr Val Asn Phe Thr He Phe Lys Val Arg Met Tyr Val Gly Gly 

260 265 270 

GTG GAG CAC AGG TTC GAA GCC GCA TGC AAT TGG ACT CGA GGA GAG CGT 8 64 

Val Glu His Arg Phe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg 

275 280 285 

TGT GAC TTG GAG GAC AGG GAT AGA TCA GAG CTT AGC CCG CTG CTG CTG 912 

Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu 



290 295 300 

TCT ACA ACA GGT GAT CGA GGG CAG ACA CCA TCA CCA CCA TCA CTA A 953 
Ser Thr Thr Gly Asp Arg Gly Gin Thr Pro Ser Pro Pro Ser Leu 
305 310 315 

TAG 961 



(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 319 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

:i (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

•-IMet Vai Gly Asn Trp Ala Lys Val Leu Val Val Met Leu Leu Phe Ala 

* 1 5 10 15 

=bly Val Aso Gly His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp 
Li * 20 25 30 

riThr Arg Gly Leu Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys He 

* 35 40 45 

-bin Leu Val Asn Thr Asn Gly Ser Trp His He Asn Arg Thr Ala Leu 
J 50 55 60 

fftsn Cys Asn Asp Ser Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr 

*\ 65 70 75 80 

ai Lys His Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys 
85 90 95 

Arg Ser He Aso Lys Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr 
100 105 110 

Glu Pro Asn Ser Ser Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro 
115 120 125 

Arg Pro Cys Gly He Val Pro Ala Ser Gin Vai Cys Gly Pro Val Tyr 
130 135 140 

Cys Phe Thr Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly 
145 150 155 160 

Vai Pro Thr Tyr Asn Trp Gly Ala Asn Asp Ser Asp Val Leu He Leu 
165 * 170 175 

Asn Asn Thr Arg Pro Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp Met 
180 185 190 

Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn He 
195 200 205 

Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg 

210 215 220 

Lys His Pro Giu Ala Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu 



225 230 235 240 

Thr Pro Arg Cys Met Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro 
245 250 255 

Cys Thr Val Asn Phe Thr He Phe Lys Val Arg Met Tyr Val Gly Gly 
260 265 270 

Val Glu His Arg Phe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg 
275 280 285 

Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu 
290 295 300 

Ser Thr Thr Gly Asp Arg Gly Gin Thr Pro Ser Pro Pro Ser Leu 
305 310 315 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1395 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1. . 1392 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(3) LOCATION: 1..1389 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 



ATG 


GTG 


GCG 


GGG 


GCC 


CAT 


TGG 


GGA 


GTC 


CTG 


GCG 


GGC CTC GCC TAC TAT 


Met 


Val 


Ala 


Gly 


Ala 


His 


Trp 


Gly 


Val 


Leu 


Ala 


Gly Leu Ala Tyr Tyr 


1 








5 










10 




15 


TCC 


ATG 


GTG 


GGG 


AAC 


TGG 


GCT 


AAG 


GTT 


TTG 


GTT 


GTG ATG CTA CTC TTT 


Ser 


Met 


Val 


Gly 


Asn 


Trp 


Ala 


Lys 


Val 


Leu 


Val 


Val Met Leu Leu Phe 








20 








25 






30 


GCC 


GGC 


GTC 


GAC 


GGG 


CAT 


ACC 


CGC 


GTG 


TCA 


GGA 


GGG GCA GCA GCC TCC 


Ala 


Gly Val 


Asp 


Gly 


His 


Thr 


Arg 


Val 


Ser 


Gly 


Gly Ala Ala Ala Ser 






35 










40 








45 


GAT 


ACC 


AGG 


GGC 


CTT 


GTG 


TCC 


CTC 


TTT 


AGC 


CCC 


GGG TCG GCT CAG AAA 


Asp 


Thr 


Arg 


Gly 


Leu 


Val 


Ser 


Leu 


Phe 


Ser 


Pro 


Gly Ser Ala Gin Lys 




50 










55 










60 


ATC 


CAG 


CTC 


GTA 


AAC 


ACC 


AAC 


GGC 


AGT 


TGG 


CAC 


ATC AAC AGG ACT GCC 


He 


Gin 


Leu 


Val 


Asn 


Thr 


Asn 


Gly 


Ser 


Trp 


His 


He Asn Arg Thr Ala 


65 










70 










75 


80 



Lib 


A A C 

AAL 


IbL 


AAL 


r nr 
bAL 


TCC 
I bb 


rTr 
bib 


r a a 

bAA 


ar a 

Abri 


Uuu 


lib 


TTT 
ill 


GGG 


GGA 
oon 


GTA 

O IA 


TTG 

X 1 o 




Leu 


Asn 


Cys 


Asn 


Asp 


Cor 


Leu 


uill 


1 I1X 


r 1 v 


r lie 


Phe 


Ala 


Ala 


Leu 


Phe 










85 










90 










95 






rn a /*"" 

TAC 


A A 71 

AAA 


r nr 
LAL 


FA 2. 21 

AAA 


1 1 L 


AAb 


TCC 
1 bb 


TrT 

1 o 1 


CCA 


tgg 




GAG 


GGG 
ooo 


TTG 

i 1 o 


GCC 


AGC 


336 


Tyr 


Lys 


Ql5 


Lys 


rrfle 


Asn 


Cor* 

OCX 


Car 
OCX 


r i v 


Cys 


Pro 


Glu 


Arg 


Leu 


Ala 


Ser 






100 










105 










110 








TGT 


CGC 


TCC 


AIL 


c a r* 
bAL 


A 7\ C 

AAb 


lib 


rrT 
bb i 


rnr 

bAb 


bbb 


Trr 

1 bo 


GGT 
OO 1 


GGG 


GTG 
o 1 ^ 


AGT 


TaG 

1 no 


J O T 


Cys 


Arg 


C V- 

ber 


He 


Asp 


Lys 


rne 


A J. a 


ri n 
bill 


biy 


T r^n 

irp 


Gl \7 

ox _y 




Toil 

Xie LL 


Th r 

X ilX 


iy^ 






115 










120 










125 










ACT 


GAG 


CCT 


AAC 


AGC 


TCG 


bAL 


C A C 

LAb 


n r 
Abo 


rrr 

bbb 


Tnr 
1 Ab 


Trr 
1 bb 


Trr 

1 bb 


r a r 
LAL 


Tar 
1 Ab 


rrr 

bbb 


4 Ji 


Thr 


Glu 


Pro 


Asn 


Ser 


ber 


Asp 


ri n 
bin 


Arg 


Pro 


iyr 


r 

bys 


ir^ 


nla 


iyr 


a i ^ 






130 










135 










140 












pCCT 


r*c a 
LbA 


LLb 


I b J. 


bb i. 


A T'T 
Ax 1 


C T2\ 
b 1 A 


ccc 

bbb 


rrr 

bo b 


TCT 




GTG 
o ± o 


TGG 
1 o^ 


GGT 

OO X 


GGA 


GTG 

O 1 o 


4 80 


. *Pro 


Arg 


Pro 


r t/c 
bys 


biy 


lie 


V ciX 


Drn 


Ala 

rtx a 


Ser 


Gin 


Val 


Cvs 

oy s 


Gly 


Pro 


Val 




J- 4 5 






10U 










1 c: - 
lJJ 










iOW 




b, ; rn a rn 


lbb 


TTr 

lib 


nrr 
ALL 


CCC 
bbb 


nrr 

Abo 


rrT 
bb i 


b 1 1 




rTr 


uuu 


AGG 


AGG 


Ga^ 


GGG 
ooo 


TTT 


523 


yjryr 


Cys 


rne 


l nr 


Pro 






V3 1 

Vdl 


Va 1 
v ai 


v a x 


uiy 


Th v- 


Thr 


Aso 


Arg 


Phe 








165 










170 










175 






Si P^T 


GTC 


LLL 


ALb 


1 Ai 


7\ 7\ C 

AAL 


TCC 
I bb 


CCC 
bbb 


CCC 
bbb 


a ar 


rar 


Trr 


Ga*^ 


rTG 

O i o 


GTr 

O 1 o 


ATT 

n a. i 


57 6 


biy 


va jl 


Pro 


l nr 


Tyr 


Asn 


irp 


ri w 
biy 


rila. 


Ren 




Car 
o e — 


Aso 


V^ 1 
v a x 


Leu 


lie 










130 










135 










190 








o 

%-CTC 


A A 

AAb 


AAL 


ALb 


CCC 

Lbb 


ccc 
LLb 


ccc 

bbb 


f r* j 


rrr 
bvjO 


a A/r 


Trr 

1 bo 


TTG 
1 1 o 




TGT 

J. O 1 


aGa 


TGG 

1 OO 


624 




Asn 


Asn 


l nr 


Arg 


Pro 


r to 


Arg 


Gl V 

uiy 


a 


1*P 


rue 


Gl v 
ox y 


o _y o 


Thr 

1 IlX 










195 










200 










205 










^ATG 


AAT 


GGC 


ACT 


GGG 


TTC 


Abb 


AAG 


a r*c 
ALb 


Tr'T 

lb 1 


CCC 

bbo 


rr 
bbo 


ccc 
LLL 


rrr 

LLb 


Trr 

1 bb 


n nr 
AAb 


£7 9 


- Met 


Asn 


biy 


inr 


Giy 


rne 


i nr 


Lys 


TH r- 

i nr 


bys 


r I ,r 

biy 


Gl \/ 

biy 


Drn 

no 


r — U 


oy a 


a c n 






210 










215 










220 












ATC 


GGG 


GGG 


GCC 


GGo 


AAL 


a A o 
AAL 


ALL 


TTC 
lib 


nrr 
Abb 


Trr 
ibo 


rrr 

bbb 


nrT 
AL i 


r a r 

bAb 


TrT 

1 b 1 


TTT 

111 




lie 


biy 


Gly 


Ala 


Gly 


Asn 


Asn 


i nr 


Leu 


TVi v" 

i nr 


r * re? 

bys 


D r* r> 

r r u 


T"h r- 

1 ilX 


Asp 


by o 


rile 




225 










230 










z Jo 










^ 4 U 




CGG 


AAG 


CAC 


ccc 


GAG 


GCC 


ALL 


i AL 


bLL 


nr n 
AbA 


Trr 
lbb 


r^T 

boi 


TCT 

1L 1 


CCC 
bbb 


ccc 
bbb 


Trr 

1 bb 


/ OO 


Arg 


Lys 


His 


Pro 


Glu 


Ala 


Thr 


Tyr 


Ala 


Arg 


Cys 


Gly 


Ser 


Gly 


Pro 


Trp 












245 










250 










255 






Lib 


ALA 




ALL 


TCT 
1 b 1 


nTr 

Al b 


C TT 
b 1 1 


CAT 
bH.1 


Tar 

1 rib 




TaT 


AGG 

rtOO 


GTG 




GAG 
ono 


TAC 


816 


Leu 


Thr 


Pro 


Arg 


Cys 


Met 


Val 


His 


Tyr 


Pro 


Tyr 


Arg 


Leu 


Trp 


His 


Tyr 










260 










265 










270 








rrr 
LLL 


1 bb 


AL 1 


rTr 
b 1 L 


a nr 
AAL 


TTC 
lib 


nrr 

ribb 


aTG 
ril b 


TTG 

lib 


rT-flO 


Oil 


AGG 


ATG 


TAG 


GTG 

O 1 o 


GGG 


864 


Pro 


Ly 5 


Thr 


Va 1 
vdl 


As n 


Pha 
rue 


Th r- 
1 XIX 


T 1 p 
lie 


r lie 




V^ 1 
v ax 


Arg 


Met 




Val 


Gly 








275 










280 










285 










GGL 


Gib 


GAb 


LAL 


Abb 


III 


rn n 
bAA 


cc^ 
bbo 


rr n 
bLA 


Trr 

l bb 


a aT 

AA1 


1 bb 


arT 

rib 1 


GGa 

bbn. 


GGa 

bbrv 


GAG 
brib 


912 

-7 X 


Gly 


Val 


Glu 


His 


Arg 


Phe 


Glu 


Ala 


Ala 


Cys 


Asn 


Trp 


Thr 


Arg 


Gly 


Glu 






290 










295 










300 












CGT 


TGT 


GAC 


TTG 


GAG 


GAC 


AGG 


GAT 


AGA 


TCA 


GAG 


CTT 


AGC 


CCG 


CTG 


CTG 


960 


Arg 


Cys 


Asp 


Leu 


Glu 


Asp 


Arg 


Asp 


Arg 


Ser 


Glu 


Leu 


Ser 


Pro 


Leu 


Leu 




305 








310 










315 










320 




CTG 


TCT 


ACA 


ACA 


GAG 


TGG 


CAG 


ATA 


CTG 


CCC 


TGT 


TCC 


TTC 


ACC 


ACC 


CTG 


1008 



Leu Ser Thr Thr Glu Trp Gin He Leu Pro Cys Ser Phe Thr Thr Leu 
325 330 335 

CCG GCC CTA TCC ACC GGC CTG ATC CAC CTC CAT CAG AAC ATC GTG GAC 105 6 

Pro Ala Leu Ser Thr Gly Leu He His Leu His Gin Asn He Val Asp 
340 345 350 

GTG CAA TAC CTG TAC GGT GTA GGG TCG GCG GTT GTC TCC CTT GTC ATC 1104 
Val Gin Tyr Leu Tyr Gly Val Gly Ser Ala Val Val Ser Leu Val He 
355 360 365 

AAA TGG GAG TAT GTC CTG TTG CTC TTC CTT CTC CTG GCA GAC GCG CGC 1152 
Lys Tro Glu Tyr Val Leu Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg 
370 375 380 

ATC TGC GCC TGC TTA TGG ATG ATG CTG CTG ATA GCT CAA GCT GAG GCC 1200 
He Cys Ala Cys Leu Trp Met Met Leu Leu He Ala Gin Ala Glu Ala 
^35 390 395 400 

mCC TTA GAG AAC CTG GTG GTC CTC AAT GCG GCG GCC GTG GCC GGG GCG 124 3 

flkla Leu Glu Asn Leu Val Val Leu Asn Ala Ala Ala Val Ala Gly Ala 

Jf; 405 410 415 

Sj:AT GGC ACT CTT TCC TTC CTT GTG TTC TTC TGT GCT GCC TGG TAC ATC 1296 

'Sis Gly Thr Leu Ser Phe Leu Val Phe Phe Cys Ala Ala Trp Tyr He 
U 420 425 430 

= AAG GGC AGG CTG GTC CCT GGT GCG GCA TAC GCC TTC TAT GGC GTG TGG 134 4 

Q,ys Gly Arg Leu Val Pre Gly Ala Ala Tyr Ala Phe Tyr Gly Val Trp 
J 435 440 445 

ScG CTG CTC CTG CTT CTG CTG GCC TTA CCA CCA CGA GCT TAT GCC TAGTAA 139: 

Ofro Leu Leu Leu Leu Leu Leu Ala Leu Pro Pro Arg Ala Tyr Ala 
D 450 455 460 

(2) INFORMATION FOR SEQ ID NO : 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 63 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Met Val Ala Gly Ala His Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr 
1 5 10 15 

Ser Met Val Gly Asn Trp Ala Lys Val Leu Val Val Met Leu Leu Phe 
20 25 30 

Ala Gly Val Asp Gly His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser 
35 40 45 

Asp Thr Arg Gly Leu Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys 
50 55 60 

He Gin Leu Val Asn Thr Asn Gly Ser Trp His He Asn Arg Thr Ala 
65 70 75 80 



0 



Leu Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe 
85 90 95 

Tyr Lys His Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser 
100 105 110 

Cys Arg Ser lie Asp Lys Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr 
115 120 125 

Thr Glu Pro Asn Ser Ser Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala 
130 135 140 

Pro Arg Pro Cys Gly lie Val Pro Ala Ser Gin Val Cys Gly Pro Val 
145 150 155 160 

Tyr Cys Phe Thr Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe 
165 170 175 

TSly Val Pro Thr Tyr Asn Trp Gly Ala Asn Asp Ser Asp Val Leu lie 
~ 180 185 190 

«Leu Asn Asn Thr Arg Pro Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp 

if] 195 200 205 

3let Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn 
!H 210 215 - 220 

s lie Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe 
Q225 230 235 * 240 

^iirg Lys His Pro Glu Ala Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp 

245 250 255 

MLeu Thr Pro Arg Cys Met Val His Tyr Pro Tyr Arg Leu Trp His Tyr 
N ! 260 265 270 

Pro Cys Thr Val Asn Phe Thr He Phe Lys Val Arg Met Tyr Val Gly 
275 280 285 

Gly Val Glu His Arg Phe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu 
290 295 300 

Arg Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu 
305 310 315 320 

Leu Ser Thr Thr Glu Trp Gin He Leu Pro Cys Ser Phe Thr Thr Leu 
325 330 335 

Pro Ala Leu Ser Thr Gly Leu lie His Leu His Gin Asn He Val Asp 
340 345 350 

Val Gin Tyr Leu Tyr Gly Val Gly Ser Ala Val Val Ser Leu Val He 
355 360 365 

Lys Trp Glu Tyr Val Leu Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg 
370 375 380 

He Cys Ala Cys Leu Trp Met Met Leu Leu He Ala Gin Ala Glu Ala 
385 390 395 400 

Ala Leu Glu Asn Leu Val Val Leu Asn Ala Ala Ala Val Ala Gly Ala 



405 



410 



415 



His Gly Thr Leu Ser Phe Leu Val Phe Phe Cys Ala Ala Trp Tyr lie 
420 425 430 

Lys Gly Arg Leu Val Pro Gly Ala Ala Tyr Ala Phe Tyr Gly Val Trp 
435 440 445 



Pro Leu Leu Leu Leu Leu Leu Ala Leu Pro Pro Arg Ala Tyr Ala 
450 455 460 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2082 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
% (D) TOPOLOGY: linear 

0 (ii) MOLECULE TYPE: cDNA 

p (iii) HYPOTHETICAL: NO 

2; (iii) ANTI-SENSE : NO 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1. .2079 
(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 1 . .207 6 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

AAT TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC GGC TTC GCC GAC CTC 

Asn Leu Gly Lys Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 

15 10 15 

GTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 

Val Gly Tyr lie Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 

20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT GCA 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 

35 40 45 

ACA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser lie Phe Leu Leu Ala Leu 

50 55 60 

CTG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG 

Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 

65 70 75 80 



ICC GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 
Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser lie Val 
85 90 95 



(If) 



TAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 336 
Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

GTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 384 
Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

CTC GCA GCT AGG AAC GCC AGC GTC CCC ACC ACG ACA ATA CGA CGC CAC 432 
Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

GTC GAT TTG CTC GTT GGG GCG GCT GCT TTC TGT TCC GCT ATG TAC GTG 4 80 

Val Aso Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
145 * 150 155 160 

GGG GAC CTC TGC GGA TCT GTC TTC CTC GTC TCC CAG CTG TTC ACC ATC 523 
Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He 
j 165 170 175 

NTCG CCT CGC CGG CAT GAG ACG GTG CAG GAC TGC AAT TGC TCA ATC TAT 576 
-Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 

t 130 185 190 

JCCC GGC CAC ATA ACG GGT CAC CGT ATG GCT TGG GAT ATG ATG ATG AAC 624 
"iPro Gly His He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 

J 195 200 205 

S: TGG TCG CCT ACA ACG GCC CTG GTG GTA TCG CAG CTG CTC CGG ATC CCA 672 
"!Trp Ser Pro Thr Thr Ala Leu Val Val Ser Gin Leu Leu Arg He Pro 
j 210 215 220 

^;CAA GCT GTC GTG GAC ATG GTG GCG GGG GCC CAT TGG GGA GTC CTG GCG 7 20 

=iGln Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly Val Leu Ala 
T225 " 230 235 240 

GGC CTC GCC TAC TAT TCC ATG GTG GGG AAC TGG GCT AAG GTT TTG GTT 7 68 

Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val Leu Val 
245 . 250 255 

GTG ATG CTA CTC TTT GCC GGC GTC GAC GGG CAT ACC CGC GTG TCA GGA 316 
Val Met Leu Leu Phe Ala Gly Val Asp Gly His Thr Arg Val Ser Gly 
260 265 270 

GGG GCA GCA GCC TCC GAT ACC AGG GGC CTT GTG TCC CTC TTT AGC CCC 8 64 

Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val Ser Leu Phe Ser Pro 
275 280 285 

GGG TCG GCT CAG AAA ATC CAG CTC GTA AAC ACC AAC GGC AGT TGG CAC 912 
Gly Ser Ala Gin Lys He Gin Leu Val Asn Thr Asn Gly Ser Trp His 
290 295 300 

ATC AAC AGG ACT GCC CTG AAC TGC AAC GAC TCC CTC CAA ACA GGG TTC 960 
He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe 
305 310 315 320 

TTT GCC GCA CTA TTC TAC AAA CAC AAA TTC AAC TCG TCT GGA TGC CCA 1008 
Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn Ser Ser Gly Cys Pro 
325 330 335 

GAG CGC TTG GCC AGC TGT CGC TCC ATC GAC AAG TTC GCT CAG GGG TGG 2056 



Glu Arg Leu Ala Ser Cys Arg Ser He Asp Lys Phe Ala Gin Gly Trp 
340 345 350 

GGT CCC CTC ACT TAC ACT GAG CCT AAC AGC TCG GAC CAG AGG CCC TAC 1104 

Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser Ser Asp Gin Arg Pro Tyr 
355 360 365 

TGC TGG CAC TAC GCG CCT CGA CCG TGT GGT ATT GTA CCC GCG TCT CAG 1152 

Cys Trp His Tyr Ala Pro Arg Pro Cys Gly He Val Pro Ala Ser Gin 

370 375 380 

GTG TGC GGT CCA GTG TAT TGC TTC ACC CCG AGC CCT GTT GTG GTG GGG 1200 

Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val Val Val Gly 

385 390 395 400 

ACG ACC GAT CGG TTT GGT GTC CCC ACG TAT AAC TGG GGG GCG AAC GAC 1248 

*=,Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Asn Trp Gly Ala Asn Asp 
~ j 405 410 415 

W ; TCG GAT GTG CTG ATT CTC AAC AAC ACG CGG CCG CCG CGA GGC AAC TGG 1296 

USer Aso Val Leu He Leu Asn Asn Thr Arg Pro Pro Arg Gly Asn Trp 

O. " 420 425 430 



-7: TTC GGC TGT ACA TGG ATG AAT GGC ACT GGG TTC ACC AAG ACG TGT GGG 134 4 

l?he Gly Cys Thr Trp Met Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly 
**[ 435 " 440 445 

^ GGC CCC CCG TGC AAC ATC GGG GGG GCC GGC AAC AAC ACC TTG ACC TGC 1392 

I! Gly Pro Pro Cys Asn He Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys 

J 450 455 460 

ll.CCC ACT GAC TGT TTT CGG AAG CAC CCC GAG GCC ACC TAC GCC AGA TGC 144 0 

;Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr Tyr Ala Arg Cys 

^465 * 470 475 480 

GGT TCT GGG CCC TGG CTG ACA CCT AGG TGT ATG GTT CAT TAC CCA TAT 14 88 

Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met Val His Tyr Pro Tyr 
485 490 495 

AGG CTC TGG CAC TAC CCC TGC ACT GTC AAC TTC ACC ATC TTC AAG GTT 1536 

Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr He Phe Lys Val 
500 505 510 

AGG ATG TAC GTG GGG GGC GTG GAG CAC AGG TTC GAA GCC GCA TGC AAT 1584 

Arg Met Tyr Val Gly Gly Val Glu His Arg Phe Glu Ala Ala Cys Asn 
515 520 525 

TGG ACT CGA GGA GAG CGT TGT GAC TTG GAG GAC AGG GAT AGA TCA GAG 1632 

Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg Aso Arg Ser Glu 

530 535 540 

CTT AGC CCG CTG CTG CTG TCT ACA ACA GAG TGG CAG ATA CTG CCC TGT 1680 

Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp Gin lie Leu Pro Cys 

545 550 555 560 

TCC TTC ACC ACC CTG CCG GCC CTA TCC ACC GGC CTG ATC CAC CTC CAT 1728 

Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly Leu He His Leu His 
565 570 575 

CAG AAC ATC GTG GAC GTG CAA TAC CTG TAC GGT GTA GGG TCG GCG GTT 1776 

Gin Asn He Val Asp Val Gin Tyr Leu Tyr Gly Val Gly Ser Ala Val 
580 585 590 



GTC 


TCC 


CTT 


GTC 


ATC 


AAA 


TGG 


GAG 


TAT 


GTC 


CTG 


TTG 


CTC 


TTC 


CTT 


CTC 


Val 


Ser 


Leu 


Val 


He 


Lys 


Trp 


Glu 


Tyr Val 


Leu 


Leu 


Leu 


Phe 


Leu 


Leu 






595 










600 










605 








CTG 


GCA 


GAC 


GCG 


CGC 


ATC 


TGC 


GCC 


TGC 


TTA 


TGG 


ATG 


ATG 


CTG 


CTG 


ATA 


Leu 


Ala 


Asp 


Ala 


Arg 


He 


Cys 


Ala 


Cys 


Leu Trp Met 


Met 


Leu 


Leu 


He 




610 










615 










620 










GCT 


CAA 


GCT 


GAG 


GCC 


GCC 


TTA 


GAG 


AAC 


CTG 


GTG 


GTC 


CTC 


AAT 


GCG 


GCG 


Ala 


Gin 


Ala 


Glu 


Ala 


Ala 


Leu 


Glu 


Asn 


Leu 


Val 


Val 


Leu 


Asn 


Ala 


Ala 


625 










630 










635 










640 


GCC 


GTG 


GCC 


GGG 


GCG 


CAT 


GGC 


ACT 


CTT 


TCC 


TTC 


CTT 


GTG 


TTC 


TTC 


TGT 


Ala 


Val 


Ala 


Gly 


Ala 


His 


Gly 


Thr 


Leu 


Ser 


Phe 


Leu 


Val 


Phe 


Phe 


Cys 










645 










650 










655 




GCT 


GCC 


TGG 


TAG 


ATC 


AAG 


GGC 


AGG 


CTG 


GTC 


CCT 


GGT 


GCG 


GCA 


TAC 


GCC 




Ala 


Trp 


Tyr 


He 


Lys 


Gly Arg 


Leu 


Val 


Pro 


Gly 


Ala 


Ala 


Tyr 


Ala 








660 










665 










670 






=2TC 


TAT 


GGC 


GTG 


TGG 


CCG 


CTG 


CTC 


CTG 


CTT 


CTG 


CTG 


GCC 


TTA 


CCA 


CCA 


;:Phe 


Tyr 


Gly 


Val 


Trp 


Pro 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Ala 


Leu 


Pro 


Pro 






675 










680 










685 









1824 



1872 



1920 



1968 



2016 



2064 



2CGA GCT TAT GCC TAGTAA 2082 
JArg Ala Tyr Ala 
690 



:;<2) INFORMATION FOR SEQ ID NO: 48: 

I (i) SEQUENCE CHARACTERISTICS: 

II (A) LENGTH : 692 amino acids 
(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Asn Leu Gly Lys Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asd Leu 
r l 5 10 15 

Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 



Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 



• 



Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

Val Asp Leu Leu Val Gly Ala Ala Ala ?he Cys Ser Ala Met Tyr Val 
145 150 155 160 

Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He 
165 170 175 

Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 
180 185 190 

Pro Gly His He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
195 200 205 

•ftTrp Ser Pro Thr Thr Ala Leu Val Val Ser Gin Leu Leu Arg lie Pro 

ffi 210 215 220 

'Sbln Ala Val Val Asd Met: Val Ala Gly Ala His Trp Gly Val Leu Ala 
^25 * 230 235 240 

Sly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val Leu Val 
Uj 245 250 255 

^Val Met Leu Leu Phe Ala Gly Val Asp Gly His Thr Arg Val Ser Gly 
™ 260 265 270 

Cbly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val Ser Leu Phe Ser Pro 
On 275 k 280 285 

UGly Ser Ala Gin Lys He Gin Leu Val Asn Thr Asn Gly Ser Trp His 
290 295 300 

lie Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe 
305 310 315 320 

Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn Ser Ser Gly Cys Pro 
325 330 335 

Glu Arg Leu Ala Ser Cys Arg Ser He Asp Lys Phe Ala Gin Gly Trp 
340 345 350 

Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser Ser Asp Gin Arg Pro Tyr 
355 360 365 

Cys Trp His Tyr Ala Pro Arg Pro Cys Gly He Val Pro Ala Ser Gin 
370 375 380 

Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val Val Val Gly 
385 390 395 400 

Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Asn Trp Gly Ala Asn Asp 
405 410 415 

Ser Asp Val Leu He Leu Asn Asn Thr Arg Pro Pro Arg Gly Asn Trp 
420 425 430 

Phe Gly Cys Thr Trp Met Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly 



435 440 445 

Gly Pro Pro Cys Asn lie Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys 
450 455 460 

Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr Tyr Ala Arg Cys 
465 470 475 480 

Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met Val His Tyr Pro Tyr 
485 490 495 

Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr lie Phe Lys Val 
500 505 510 

Arg Met Tyr Val Gly Gly Val Glu His Arg Phe Glu Ala Ala Cys Asn 
515 520 525 

. Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu 
2 530 535 540 

5 Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp Gin lie Leu Pro Cys 
J545 550 555 560 

"•Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly Leu He His Leu His 

-| 565 570 575 

^Gln Asn lie Val Asp Val Gin Tyr Leu Tyr Gly Val Gly Ser Ala Val 
530 * 585 ^ 590 

; Val Ser Leu Val lie Lys Trp Glu Tyr Val Leu Leu Leu Phe Leu Leu 

-! 595 * 600 605 

] Leu Ala Asp Ala Arg He Cys Ala Cys Leu Trp Met Met Leu Leu lie 
610 615 620 

Ala Gin Ala Glu Ala Ala Leu Glu Asn Leu Val Vai Leu Asn Ala Ala 
625 630 635 640 

Ala Vai Ala Gly Ala His Gly Thr Leu Ser Phe Leu Val Phe Phe Cys 
645 650 655 

Ala Ala Trp Tyr He Lys Gly Arg Leu Val Pro Gly Ala Ala Tyr Ala 
660 665 670 

Phe Tyr Gly Val Trp Pro Leu Leu Leu Leu Leu Leu Ala Leu Pro Pro 
675 680 685 

Arg Ala Tyr Ala 
690 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2433 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(iii) HYPOTHETICAL: NO 



(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..243G 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(3) LOCATION: 1..2427 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

ATG AGC ACG AAT CCT AAA CCT CAA AGA AAA ACC AAA CGT AAC ACC AAC 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 

\ 1 5 10 15 

ICGC CGC CCA CAG GAC GTC AAG TTC CCG GGC GGT GGT CAG ATC GTT GGT 

:Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 

20 25 30 

1 GGA GTT TAC CTG TTG CCG CGC AGG GGC CCC AGG TTG GGT GTG CGC GCG 

Gly Val Tvr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Aia 

35 40 45 

ACT AGG AAG ACT TCC GAG CGG TCG CAA CCT CGT GGG AGG CGA CAA CCT 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 

50 55 SO 

ATC CCC AAG GCT CGC CGA CCC GAG GGT AGG GCC TGG GCT CAG CCC GGG 

lie Pro Lys Ala Arg Arg Pro Glu Gly Arg Aia Trp Ala Gin Pro Gly 

65 70 75 80 

TAC CCT TGG CCC CTC TAT GGC AAT GAG GGC ATG GGG TGG GCA GGA TGG 

Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Met Gly Tro Ala GIv Trp 

85 90 " 95 

CTC CTG TCA CCC CGC GGC TCT CGG CCT AGT TGG GGC CCT ACA GAC CCC 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Tro Giy Pro Thr Asp Pro 

100 105 * 110 

CGG CGT AGG TCG CGT AAT TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC 

Arg Arg Arg Ser Arg Asn Leu Gly Lys Val lie Asp Thr Leu Thr Cys 

115 120 125 

GGC TTC GCC GAC CTC GTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA 

Gly Phe Ala Asp Leu Val Gly Tyr lie Pro Leu Val Gly Aia Pro Leu 

130 135 140 

GGG GGC GCT GCC AGG GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC 

Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 

145 150 155 160 

GGC GTG AAC TAT GCA ACA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC 

Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Giy Cys Ser Phe Ser lie 

165 170 175 



TTC CTC TTG GCT TTG CTG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT 
Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr 



# 



180 185 190 

GAA GTG CGC AAC GTG TCC GGG ATG TAG CAT GTC ACG AAC GAC TGC TCC 624 
Glu Val Arg Asn Val Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser 
195 200 205 

AAC TCA AGC ATT GTG TAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC 672 
Asn Ser Ser lie Val Tyr Glu Ala Ala Asp Met lie Met His Thr Pro 
210 215 220 

GGG TGC GTG CCC TGC GTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA 720 
Gly Cys Val Pro Cys Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val 
225 230 235 240 

GCG CTC ACC CCC ACG CTC GCA GCT AGG AAC GCC AGC GTC CCC ACC ACG 7 68 

Ala Leu Thr Pro Thr Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr 
245 250 255 

yACA ATA CGA CGC CAC GTC GAT TTG CTC GTT GGG GCG GCT GCT TTC TGT 816 

^Thr lie Arg Arg His Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys 
~ 260 265 270 

:, ;TCC GCT ATG TAC GTG GGG GAC CTC TGC GGA TCT GTC TTC CTC GTC TCC 8 64 

= ; Ser Ala Met Tyr Val Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser 

" ; 275 280 ' 285 

CAG CTG TTC ACC ATC TCG CCT CGC CGG CAT GAG ACG GTG CAG GAC TGC 912 
; Gin Leu Phe Thr lie Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys 
290 295 300 

J AAT TGC TCA ATC TAT CCC GGC CAC ATA ACG GGT CAC CGT ATG GCT TGG 960 
iAsn Cys Ser lie Tyr Pro Gly His lie Thr Gly His Arg Met Ala Tro 
:305 310 * 315 320 

GAT ATG ATG ATG AAC TGG TCG CCT ACA ACG GCC CTG GTG GTA TCG CAG 1008 
Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val Val Ser Gin 
325 330 335 

CTG CTC CGG ATC CCA CAA GCT GTC GTG GAC ATG GTG GCG GGG GCC CAT 105 6 

Leu Leu Arg He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His 
340 345 * 350 

TGG GGA GTC CTG GCG GGC CTC GCC TAC TAT TCC ATG GTG GGG AAC TGG 1104 
Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp 
355 360 365 

GCT AAG GTT TTG GTT GTG ATG CTA CTC TTT GCC GGC GTC GAC GGG CAT 1152 
Ala Lys Val Leu Val Val Met Leu Leu Phe Ala Glv Val Asp Gly His 
370 375 380 

ACC CGC GTG TCA GGA GGG GCA GCA GCC TCC GAT ACC AGG GGC CTT GTG 1200 
Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val 
385 390 395 400 

TCC CTC TTT AGC CCC GGG TCG GCT CAG AAA ATC CAG CTC GTA AAC ACC 124 8 

Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys He Gin Leu Val Asn Thr 
405 410 415 

AAC GGC AGT TGG CAC ATC AAC AGG ACT GCC CTG AAC TGC AAC GAC TCC 1296 
Asn Gly Ser Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser 
420 425 430 



CTC CAA ACA GGG TTC TTT GCC GCA CTA TTC TAC AAA CAC AAA TTC AAC 
Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn 
435 440 445 



1344 



TCG TCT GGA TGC CCA GAG CGC TTG GCC AGC TGT CGC TCC ATC GAC AAG 
Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys Arg Ser He Asp Lys 
450 455 460 



1392 



TTC 
Phe 
465 



GCT CAG GGG TGG 
Ala Gin Gly Trp 



GGT CCC 
Gly Pro 
470 



CTC ACT TAC 
Leu Thr Tyr 



ACT GAG CCT 
Thr Glu Pro 
475 



AAC AGC TCG 
Asn Ser Ser 
480 



1440 



GAC 
Asp 



CAG 
Gin 



AGG CCC TAC 
Arg Pro Tyr 
485 



TGC TGG 
Cys Trp 



CAC TAC GCG 
His Tyr Ala 
490 



CCT CGA CCG 
Pro Arg Pro 



TGT GGT ATT 
Cys Gly He 
495 



1488 



JGTA 
^Val 



CCC 
Pro 



GCG TCT CAG 
Ala Ser Gin 
500 



GTG TGC 
Val Cvs 



GGT CCA GTG 
Gly Pro Val 
505 



TAT TGC TTC 
Tyr Cys Phe 



ACC CCG AGC 
Thr Pro Ser 
510 



1536 



;;CCT 
";Pro 



GT1 
Val 



GTG GTG GGG 
Val Val Gly 
515 



ACG ACC 
Thr Thr 



GAT CGG TTT 
Asp Arg Phe 
520 



GGT GTC 
Gly Val 



CCC 
Pro 
525 



ACG TAT AAC 
Thr Tyr Asn 



1584 



: TGG 
Trp 



GGG 
Gly 
530 



GCG AAC GAC 
Ala Asn Asp 



TCG GAT 
Ser Aso 
535 



GTG CTG ATT 
Val Leu lie 



CTC AAC 
Leu Asn 
540 



AAC 
Asn 



ACG CGG CCG 
Thr Arg Pro 



1632 



; ccg 

Pro 

545 



CGA 
Arg 



GGC AAC TGG 
Gly Asn Trp 



TTC GGC 
Phe Gly 
550 



TGT ACA TGG 
Cys Thr Trp 



ATG AAT 
Met Asn 
555 



GGC 
Gly 



ACT GGG TTC 
Thr Gly Phe 
560 



1680 



ACC AAG ACG TGT GGG GGC CCC CCG TGC AAC ATC GGG GGG GCC GGC AAC 
Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn lie Gly Gly Ala Gly Asn 
565 570 575 



1728 



AAC ACC TTG ACC TGC CCC ACT GAC TGT TTT CGG AAG CAC CCC GAG GCC 
Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg Lvs His Pro Glu Ala 
580 585 " 590 



1776 



ACC TAC GCC AGA TGC GGT TCT GGG CCC TGG CTG ACA CCT AGG TGT ATG 
Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met 
595 600 605 



1824 



GTT CAT TAC CCA TAT AGG CTC TGG CAC TAC CCC TGC ACT GTC AAC TTC 
Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe 
610 615 620 



1872 



ACC 
Thr 
625 



ATC TTC AAG GTT 
lie Phe Lys Val 



AGG ATG 
Arg Met 
630 



TAC GTG GGG 
Tyr Val Gly 



GGC GTG GAG 
Gly Val Glu 
635 



CAC AGG TTC 
His Arg Phe 
640 



1920 



GAA GCC GCA TGC AAT TGG ACT CGA GGA GAG CGT TGT GAC TTG GAG GAC 
Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys Asd Leu Glu Asp 
645 " 650 " 655 



1968 



AGG GAT AGA TCA GAG CTT AGC CCG CTG CTG CTG TCT ACA ACA GAG TGG 
Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp 
660 665 670 



2016 



• 



CAG ATA CTG CCC TGT TCC TTC ACC ACC CTG CCG GCC CTA TCC ACC GGC 2064 
Gin lie Leu Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly 
675 680 685 

CTG ATC CAC CTC CAT CAG AAC ATC GTG GAC GTG CAA TAC CTG TAC GGT 2112 
Leu lie His Leu His Gin Asn lie Val Asp Val Gin Tyr Leu Tyr Gly 
690 695 700 

GTA GGG TCG GCG GTT GTC TCC CTT GTC ATC AAA TGG GAG TAT GTC CTG 2160 
Val Gly Ser Ala Val Val Ser Leu Val He Lys Trp Glu Tyr Val Leu 
705 710 715 720 

TTG CTC TTC CTT CTC CTG GCA GAC GCG CGC ATC TGC GCC TGC TTA TGG 2208 
Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg He Cys Ala Cys Leu Trp 
725 730 735 

ATG ATG CTG CTG ATA GCT CAA GCT GAG GCC GCC TTA GAG AAC CTG GTG 2256 
.Met Met Leu Leu He Ala Gin Ala Glu Ala Ala Leu Glu Asn Leu Val 
740 745 750 

CStC CTC AAT GCG GCG GCC GTG GCC GGG GCG CAT GGC ACT CTT TCC TTC 2304 

J5!al Leu Asn Ala Ala Ala Val Ala Gly Ala His Gly Thr Leu Ser Phe 

yF; 755 760 765 

SjTT GTG TTC TTC TGT GCT GCC TGG TAC ATC AAG GGC AGG CTG GTC CCT 2352 

Hen Val Phe Phe Cys Ala Ala Trp Tyr He Lys Gly Arg Leu Val Pro 
W 770 775 ' 780 

EgGT GCG GCA TAC GCC TTC TAT GGC GTG TGG CCG CTG CTC CTG CTT CTG 24 00 

sQly Ala Ala Tyr Ala Phe Tyr Gly Val Trp Pro Leu Leu Leu Leu Leu 

?3$S 790 795 800 

3£TG GCC TTA CCA CCA CGA GCT TAT GCC TAGTAA 24 33 

4jeu Ala Leu Pro Pro Arg Ala Tyr Ala 

805 810 



(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 809 amino acids 
{B } TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 
15 10 15 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin He Val Gly 
20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 



He Pro Lys Ala Arg Arg Pro Glu Gly Arg Ala Trp Ala Gin Pro Gly 
65 70 75 80 



Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Met Gly Trp Ala Gly Trp 
85 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 
100 105 110 

Arg Arg Arg Ser Arg Asn Leu Gly Lys Val lie Asp Thr Leu Thr Cys 
115 120 125 

Gly Phe Ala Asp Leu Val Gly Tyr lie Pro Leu Val Gly Ala Pro Leu 
130 135 140 

Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 
165 170 175 

Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr 

180 185 190 

Glu Val Arg Asn Val Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser 
195 200 205 

Asn Ser Ser He Val Tyr Glu Ala Ala Asd Met lie Met His Thr Pro 
210 215 * 220 

Gly Cys Val Pro Cys Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val 
225 230 235 240 

Ala Leu Thr Pro Thr Leu Ala Aia Arg Asn Ala Ser Val Pro Thr Thr 
245 250 255 

i: Thr He Arg Arg His Val Asp Leu Leu Val Gly Ala Ala Aia Phe Cys 
260 265 270 

Ser Ala Met Tyr Val Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser 
275 ' 280 285 

Gin Leu Phe Thr He Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys 
' 290 295 300 

Asn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg Met Ala Trp 
305 310 315 320 

Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val Val Ser Gin 
325 330 335 

Leu Leu Arg He Pro Gin Ala Val Val Asp Met Val Ala Gly Aia His 
340 345 350 

Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp 
355 360 365 

Ala Lys Val Leu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly His 
370 375 380 

Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val 
385 390 395 400 



Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys lie Gin Leu Val Asn Thr 
405 410 415 



Asn Gly Ser Trp His lie Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser 
420 425 430 

Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn 
435 440 445 

Ser Ser Gly Cys Pro Giu Arg Leu Ala Ser Cys Arg Ser lie Asp Lys 
450 455 460 

Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser Ser 
465 470 475 480 



Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly He 

485 490 495 

Val Pro Ala Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser 

500 505 510 



jPro Val Val Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Asn 
: 515 520 525 

\Tro Gly Ala Asn Asp Ser Asp Val Leu lie Leu Asn Asn Thr Arg Pro 
530 535 540 

Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp Met Asn Gly Thr Gly Phe 
1545 550 ' 555 560 

:Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn He Gly Gly Ala Gly Asn 
565 570 575 

;i Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Giu Ala 
1 580 585 590 

Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met 
595 600 * 605 

.Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe 
610 615 620 

Thr He Phe Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Phe 
625 630 635 640 

Glu Ala Ala Cys Asn Trp Thr Arg Gly Giu Arg Cys Asp Leu Glu Asd 
645 650 " 655 

Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp 
660 665 670 

Gin He Leu Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly 
675 680 685 

Leu He His Leu His Gin Asn He Val Asp Val Gin Tyr Leu Tyr Gly 
690 695 * 700 

Val Gly Ser Ala Val Val Ser Leu Val He Lys Trp Glu Tyr Val Leu 
705 710 715 720 



Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg He Cys Ala Cys Leu Trp 



725 730 735 



Met 


Met 


Leu 


Leu 
740 


He 


Ala 


Gin 


Ala 


Glu Ala Ala Leu Glu Asn Leu Val 
745 750 


Val 


Leu 


Asn 
755 


Ala 


Aia 




v ai 


Ala 
760 


Gly Ala His Gly Thr Leu Ser Phe 
765 


Leu 


Val 
770 


Phe 


Phe 


Cys 


Ala 


Ala 
775 


Trp 


Tyr He Lys Gly Arg Leu Val Pro 
780 


Gly Ala 
785 

Leu Ala 


Ala 
Leu 


Tyr 
Pro 


Ala 

Pro 
805 


Phe 
790 
Arg 


Tyr 
Ala 


Gly 
Tyr 


Val Trp Pro Leu Leu Leu Leu Leu 
795 800 

Ala 



« : (2) INFORMATION FOR SEQ ID NO: 51: 

3 (i) SEQUENCE CHARACTERISTICS: 

J (A) LENGTH: 17 amino acids 

„!; (3) TYPE: amino acid 

□ (C) STRANDEDNESS: single 

j (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



<ix) FEATURE: 

(A) NAME /KEY : Modi f ied-site 
(3) LOCATION: 1 . . 17 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Ser Asn Ser Ser Glu Ala Ala Asc Met He Met His Thr Pro Gly Cys 
15 10 15 

Val 



(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 1. .22 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Gly Gly He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 



10 15 



Ser Pro Thr Thr Ala Leu 
20 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 1..37 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

Tyr GIu Val Arg Asn Val Ser Gly He Tyr His Val Thr Asn Asp Cys 
15 10 15 

Ser Asn Ser Ser He Val Tvr Glu Ala Ala Asp Met He Met His Thr 
20 25 30 

Pro Gly Cys Gly Lys 
35 

[2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 1..25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Gly Gly Thr Pro Thr Val Ala Thr Arg Asp Gly Lys Leu Pro Ala Thr 
15 10 15 

Gin Leu Arg Arg His He Asp Leu Leu 
20 25 

2} INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 
(3) TYPE: amino acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 1..25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Gly Gly Thr Pro Thr Leu Ala Ala Arg Asp Ala Ser Val Pro Thr Thr 
15 10 15 

Thr lie Arg Arg His Val Asp Leu Leu 
20 25 



(2) INFORMATION FOR SEQ ID NO: 56: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(iij MOLECULE TYPE: peptide 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Gin Val Arg Asn 
1 5 10 15 

Ser Thr Gly Leu 
20 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Gin Val Arg Asn Ser Thr Gly Leu Tyr His Val Thr Asn Asp Cys Pro 
15 10 15 

Asn Ser Ser lie 
20 

(2) INFORMATION FOR SEQ ID NO: 58: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

Asn Asp Cys Pro Asn Ser Ser lie Val Tyr Glu Ala His Asp Ala lie 
15 10 15 

Leu His Thr Pro 
20 

■(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 
j (3) TYPE: amino acid 

(C) STRANDEDNESS: single 
j (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 59: 

Ser Asn Ser Ser He Val Tyr Glu Ala Ala Asd Met He Met His Thr 
1 5 10 * 15 

Pro Gly Cys Val 
20 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

His Asp Ala He Leu His Thr Pro Gly Val Pro Cys Val Arg Glu Gly 
15 10 15 

Asn Val Ser 

(2) INFORMATION FOR SEQ ID NO: 61: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Cys Val Arg Glu Gly Asn Val Ser Arg Cys Trp Val Ala Met Thr Pro 
15 10 15 

Thr Val Ala Thr 
20 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



«| (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

I! Ala Met Thr Pro Thr Val Ala Thr Arg Asd Gly Lys Leu Pro Ala Thr 
fi 1 5 10' 15 

T Gin Leu Arg Arg 
20 

(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

Leu Pro Ala Thr Gin Leu Arg Arg His lie Asp Leu Leu Val Gly Ser 
1 5 10 15 

Ala Thr Leu Cys 
20 

(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 




(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

Leu Val Gly Ser Ala Thr Leu Cys Ser Ala Leu Tyr Val Gly Asp Leu 
15 10 15 

Cys Gly Ser Val 



(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

Gin Leu Phe Thr Phe Ser Pro Arg Arg His Trp Thr Thr Gin Gly Cys 
1 5 10 15 

Asn Cys Ser lie 



l \2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

Thr Gin Gly Cys Asn Cys Ser He Tyr Pro Gly His He Thr Gly His 
1 5 10 15 

Arg Met Ala Trp 



(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 



20 



20 



20 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

lie Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn Tro Ser Pro 
1 5 10 15 

Thr Ala Ala Leu 
20 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



h! (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

L Asn Trp Ser Pro Thr Ala Ala Leu Val Met Ala Gin Leu Leu Arg lie 

j=; 1 5 10 15 

D Pro Gin Ala He 

[F; 20 

y(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

Leu Leu Arg He Pro Gin Ala He Leu Aso Met He Ala Gly Ala His 
1 5 10* 15 

TrD Gly Val Leu 
20 

(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

Ala Gly Ala His Trp Gly Val Leu Ala Gly He Ala Tyr Phe Ser Met 
15 10 15 

Val Gly Asn Met 
20 



(2) INFORMATION FOR SEQ ID NO: 71; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

Val Val Leu Leu Leu Phe Ala Gly Val Asp Ala Giu Thr He Val Ser 
15 10 15 

Gly Gly Gin Ala 
20 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

Ser Gly Leu Val Ser Leu Phe Thr Pro Gly Aia Lys Gin Asn He Gin 
15 10 15 

Leu He Asn Thr 
20 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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# ft 



(ii) MOLECULE TYPE; peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

Gin Asn lie Gin Leu lie Asn Thr Asn Gly Gin Trp His lie Asn Ser 
15 10 15 

Thr Ala Leu Asn 
20 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 



^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

yj Leu Asn Cys Asn Giu Ser Leu Asn Thr Gly Trp Tro Leu Ala Gly Leu 
1 5 10 * 15 

H lie Tyr Gin His Lys 

EP[2) INFORMATION FOR SEQ ID NO: 75: 

r= (i) SEQUENCE CHARACTERISTICS: 

r "' (A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

, (ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

Ala Gly Leu He Tyr Gin His Lys Phe Asn Ser Ser Gly Cys Pro Glu 
15 10 15 

Arg Leu Ala Ser 
20 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

Gly Cys Pro Glu Arg Leu Ala Ser Cys Arg Pro Leu Thr Asp Phe Asp 
15 10 15 

Gin Gly Trp Gly 
20 

2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NC: 77: 

Thr Asp Phe Aso Gin Gly Trp Gly Pro lie Ser Tyr Ala Asn Gly Ser 
1 5 " 10 15 

Gly Pro Asp Gin 
20 

2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

Ala Asn Gly Ser Gly Pro Asp Gin Arg Pro Tyr Cys Trp His Tyr Pro 
1 5 * 10 15 

Pro Lys Pro Cys 

20 

2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



• * 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

Trp His Tyr Pro Pro Lys Pro Cys Gly lie Val Pro Ala Lys Ser Val 
15 10 15 

Cys Gly Pro Val 
20 

(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

»i (ii) MOLECULE TYPE: peptide 



%D (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

S Ala Lys Ser Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val 

jr; i 5 io is 

L Val Val Gly Thr 

O 20 

Q2) INFORMATION FOR SEQ ID NO: 81: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Ser Gly Ala Pro Thr 
15 10 15 

Tyr Ser Trp Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



• 9 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

Gly Ala Pro Thr Tyr Ser Trp Gly Glu Asn Asp Thr Asp Val Phe Val 
15 10 15 

Leu Asn Asn Thr 
20 

(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



W (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

Gly Asn Tro Phe Gly Cys Thr Trp Met Asn Ser Thr Gly Phe Thr Lys 

s7i 1 * 5 10 ' 15 

u Val Cys Gly Ala 

W 20 

02) INFORMATION FOR SEQ ID NO: 84: 

~! (i) SEQUENCE CHARACTERISTICS: 
hr (A) LENGTH: 20 amino acids 

J£; (B) TYPE: amino acid 

O (C) STRANDEDNESS: single 

U (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

Gly Phe Thr Lys Val Cys Gly Ala Pro Pro Val Cys He Gly Gly Ala 
15 10 15 

Gly Asn Asn Thr 
20 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 



He Gly Gly Ala Gly Asn Asn Thr Leu His Cys Pro Thr Asp Cys Arg 
15 10 15 

Lys His Pro 



) INFORMATION FOR S£Q ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

Thr Asd Cys Phe Arc Lys His Pro Aso Ala Thr Tvr Ser Arg Cys Gly 
1 ' 5 ' 10 15 

Ser Gly Pro Trp 
20 

) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 
{3) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 87: 

Ser Arg Cys Gly Ser Gly Pro Trp He Thr Pro Arg Cys Leu Val Asp 
15 10 15 

Tyr Pro Tyr Arg 
20 

) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

Cys Leu Val Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr He 



• 



1 



5 



10 



15 



Asn Tyr Thr lie 

20 



(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 89: 

Pro Cys Thr lie Asn Tyr Thr lie Phe Lys lie Arg Met Tyr Val Gly 
15 10 15 

Gly Val Glu His 



(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

Met Tyr Val Gly Gly Val Glu His Arg Leu Glu Ala Ala Cys Asn Trp 
15 10 15 

Thr Pro Gly Glu 



(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

Ala Cys Asn Trp Thr Pro Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp 
15 10 15 



20 



20 
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Arg Ser Glu Leu 
20 



(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



O (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

rfi Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu Thr Thr Thr 

J; 1 * 5 10 15 

^ Gin Tro Gin Vai 

w 20 

W2) INFORMATION FOR SEQ ID NO: 93: 

P=i (i) SEQUENCE CHARACTERISTICS: 

*T\ (A) LENGTH: 9 amino acids 

y (B) TYPE: amino acid 

0 (C) STRANDEDNESS: single 

01 (D) TOPOLOGY: linear 

y ; (ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

Tyr Gin Val Arg Asn Ser Thr Gly Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 
ACGTCCGTAC GTTCGAATTA ATTAATCGA 



29 



(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE : YES 



O (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

-SCTCCGGACG TGCACTAGCT CCCGTCTGTG GTAGTGGTGG TAG T GAT TAT CAATTAATTG 60 



UJ2) INFORMATION FOR SEQ ID NO: 96: 

?7i (i) SEQUENCE CHARACTERISTICS: 
T" 5 (A) LENGTH: 19 base pairs 

]L (B) TYPE: nucleic acid 

D (C) STRANDEDNESS: single 

Si (D) TOPOLOGY: linear 

ffi (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
GTTTAACCAC T GC AT GAT G 19 
(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 
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GTCCCATCGA GTGCGGCTAC 



(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 
j&GTGACATGG TACATTCCGG ACACTTGGCG CACTTCATAA GCGGA 
7(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 99: 
TGCCTCATAC ACAATGGAGC TCTGGGACGA GTCGTTCGTG AC 
{2} INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 



• 




TACCCAGCAG CGGGAGCTCT GTTGCTCCCG AAC GCAGGGC AC 



42 



(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

ClGTCGTGGTG GGGACGGAGG CCTGCCTAGC TGCGAGCGTG GG 42 
%02) INFORMATION FOR SEQ ID NO: 102: 

iij (i) SEQUENCE CHARACTERISTICS: 

S (A) LENGTH: 48 base pairs 

{3) TYPE: nucleic acid 
W (C) STRANDEDNESS: single 

2 (D) TOPOLOGY: linear 

SJ (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

C (iii) ANTI-SENSE : NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 
CGTTATGTGG CCCGGGTAGA TTGAGCACTG GCAGTCCTGC ACCGTCTC 48 
(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE : NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
CAGGGCCGTT CTAGGCCTCC ACTGCATCAT CATATCCCAA GC 4 2 



(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 

CCGGAATGTA CCATGTCACG AACGAC 

;;(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA {genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



f (xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GCTCCATTGT GTATGAGGCA GCGG 
(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GAGCTCCCGC TGCTGGGTAG CGC 
(2) INFORMATION FOR SEQ ID NO: 107: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

CCTCCGTCCC CACCACGACA ATACG 25 

(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 
! (A) LENGTH: 27 base pairs 

(3) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE : NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

CTACCCGGGC CACATAACGG GTCACCG 27 

(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

GGAGGCCTAC AACGGCCCTG GTGG 24 

(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{ D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 
TTCTATCGAT TAAATAGAAT TC 22 
(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 
jjCCATACGCT CACAGCCGAT CCC 23 
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